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V 1 

.1 

COMPOSITIONS AND METHODS FOR THE THERAPY AND DIAGNOSIS OF 

BREAST CANCER 

TECHNICAL FIELD OFTHE INVENTION 

The present invention relates generally to therapy and diagnosis of 
5 cancer, such as breast; cancer. The invention is more specifically related to 
polypeptides, comprising at least a portion of a breast tumor protein, and to 
polynucleotides encoding such polypeptides. Such polypeptides and polynucleotides 
are useful in pharmaceutical compositions, e.g., vaccines, and other compositions for 
the diagnosis and treatment of breast cancer. 

10 

BACKGROUND OF THE INVENTION 
Field of the Invention 

Breast cancer is a significant health problem for women in the United 
States and throughout the world. Although advances have been made in detection and 
15 treatment of the disease, breast cancer remains the second leading cause of cancer- 
related deaths in women, affecting more than 180,000 women m the United States each 
year. For women in North America, the life-time odds of getting breast cancer are one 
in eight. 

Description of the Related Art 

20 No vaccine or other universally successful method for tiie prevention or 

treatment of breast cancer is currently available. Management of the disease currentiy 
relies on a combination of early diagnosis (through routine breast screening procedures) 
and aggressive treatment, which may include one or more of a variety of treatments 
such as surgery, radiotherapy, chemotherapy and hormone therapy. The course of 

25 treatment for a particular breast cancer is often selected based on a variety of prognostic 
parameters, including an analysis of specific tumor markers. See, e.g.. Porter- Jordan 
and Lippman, Breast Cancer S:73-100 (1994). However, the use of established markers 
often leads to a result that is difficult to interpret, and the high mortahty observed in 
breast cancer patients indicates that improvements are needed in the treatment, 

30 diagnosis and prevention of the disease. 
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Accordingly, there is a need in the art for improved methods for the 
treatment and diagnosis of breast cancer. The present invention fulfills these needs and 
further provides other related advantages. 

SUMMARY OF THE INVENTION 
5 In one aspect, the present invention provides polynucleotide 

compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID N0:l-61, 63-175, 178, 180, 182- 
468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504-506, 510-513, 520-533, 548-550, 
564, 566-569, and 576; 

10 (b) complements of the sequences provided in SEQ ID N0:l-61, 63- 

175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504-506, 510-513, 
520-533, 548-550, 564, 566^569, and 576; 

(c) sequences consisting of at least 20 contiguous residues of a 
sequence provided in SEQ ID N0:l-61, 63-175, 178, 180, 1 82-468, 474, 476, 477, 479, 

15 482, 484, 486, 489-492, 504-506, 510-513, 520-533, 548-550, 564, 566-569, and 576; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
N0:l-61, 63-175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504- 
506, 510-513, 520-533, 548-550, 564, 566-569, and 576, under moderately stringent 
conditions; 

20 (e) sequences having at least 75% identity to a sequence of SEQ ID 

N0:l-61, 63-175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504- 
506, 510-513, 520-533, 548-550, 564, 566-569, and 576; 

(f) sequences having at least 90% identity to a sequence of SEQ ID 
N0:l-61, 63-175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504- 

25 506, 510-513, 520-533, 548-550, 564, 566-569, and 576; and 

(g) degenerate variants of a sequence provided in SEQ ID NO: 1-61, 
63-175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504-506, 510- 
513, 520-533, 548-550, 564, 566-569, and 576. 

In one preferred embodiment, the polynucleotide compositions of the 
30 invention are expressed in at least about 20%, more preferably in at least about 30%, 
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and most preferably in at least about 50% of breast tumors samples tested, at a level 
that is at least about 2-fold, preferably at least about 5-fold, and most preferably at least 
about 10-fold higher than that for normal tissues. 

The present invention, m another aspect, provides polypeptide 
5 compositions comprising an amino acid sequence that is encoded by a polynucleotide 
sequence described above. 

The present invention forther provides polypeptide compositions 
comprising an amino acid sequence selected from the group consistmg of sequences 
recited in SEQ ID NO:62, 176, 179, 181, 469-473, 475, 478, 483, 485, 487, 488, 493- 

10 503, 507-509, 514-519, 534-547, 551-553, 565, 570-573, and 577-627. 

In certain preferred embodiments, the polypeptides and/or 
polynucleotides of the present invention are immunogenic, they are capable of 
eliciting an immune response, particularly a humoral and/or cellular immime response, 
as further described herein. 

15 The present invention further provides fragments, variants and/or 

derivatives of the disclosed polypeptide and/or polynucleotide sequences, wherein the 
fragments, variants and/or derivatives preferably have a level of inmiunogenic activity 
of at least about 50%, preferably at least about 70% and more preferably at least about 
90% of the level of immimogenic activity of a polypeptide sequence set forth in SEQ ID 

20 NO: 62, 176, 179, 181, 469-473, 475, 478, 483, 485, 487, 488, 493-503, 507-509, 514- 
519, 534-547, 551-553, 565, 570-573, and 577-627 or a polypeptide sequence encoded 
by a polynucleotide sequence set forth in SEQ E) NO: 1-61, 63-175, 178, 180, 182-468, 
474, 476, 477, 479, 482, 4.84, 486, 489-492, 504-506, 510-513, 520-533, 548-550, 564, 
566-569, and 576. 

25 The present invention further provides polynucleotides that encode a 

polypeptide described above, expression vectors comprising such polynucleotides and 
host cells transformed or transfected with such expression vectors. 

Within other aspects, the present invention provides pharmaceutical 
compositions comprising a polypeptide or polynucleotide as described above and a 

30 physiologically acceptable carrier. 
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10 



Within a related aspect of the present invention, the pharmaceutical 
compositions, e.g., vaccine compositions, are provided for prophylactic or therapeutic 
applications. Such compositions generaUy comprise an immunogenic polypeptide or 
polynucleotide of the invention and an immunostimulant, such as an adjuvant. 

The present invention further provides pharmaceutical compositions that 
comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a polypeptide of the present invention, or a fragment thereof; and (b) a physiologically 

acceptable carrier. 

Within further aspects, the present invention provides pharmaceutical 
compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
described above and (b) a phamiaceutically acceptable carrier or excipient Illustrative 
antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 
and B cells. 

Within related aspects, pharmaceutical compositions are provided that 
15 comprise: (a) an antigen presenting cell that expresses a polypeptide as described above 

and (b) an immunostimulant. 

The present invention further provides, in other aspects, fosion proteins 
that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of phamiaceutical compositions, 
20 e.g., vaccine compositions, comprising a physiologically acceptable carrier and/or an 
immunostimulant The fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof as described herein, and may further comprise 
one or more polypeptide segments for fecUitating the expression, purification and/or 
inmiunogenicity of the polypeptide(s). 
25 The present invention fiirther provides, in other aspects, fiision proteins 

that 

comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fiision proteins. Exemplary fiision proteins according to the present 
invention comprise a first amino acid portion and a second amino acid portion wherein 
30 the first amino acid portion includes 9 or more contiguous amino acids from 
n^ammaglobin as depicted by amino acids 1-93 of SEQ ID N0.493 (SEQ ID NO:503); 
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wherein the second amino acid portion includes 9 or more contiguous amino acids from 
B726P as depicted by SEQ ID NO:475, SEQ ID NO:469, or SEQ E) NO:176; and 
wherein the first amino acid portion is cormected to either the amino terminal or 
carboxy-terminal end of the second amino acid portion. 
5 Still further embodiments of the present invention provide fusion 

proteins 

wherein said first amino acid portion is selected from the group consisting of: 
IDELKECFLNQTDETLSNVE (SEQ E) NO:496; amino acids 59-78 of SEQ ID 
NO:493); TTNAIDELKECFLNQ (SEQ ID NO:497; amino acids 55-69 of SEQ ID 

10 NO:493); SQHCYAGSGCPLLENVISKTl (SEQ ID NO:498; amino acids 13-33 of 
SEQ ID NO:493); EYKELLQEFIDDNATTNAID (SEQ ID NO:499; amino acids 41-60 
of SEQ ID NO:493); KLLMVLMLA (SEQ ID NO:500; amino acids 2-10 of SEQ ID 
NO:493); QEFIDDNATTNAI (SEQ ID NO:501; amino acids 47-59 of SEQ ID 
NO:493); LKECFLNQTDETL (SEQ ID NO:502; amino acids 62-74 of SEQ ID 

15 NO:493), and any one of the amino acid sequences set forth m SEQ ID NO:578-593. 

Alternative embodiments provide fusion proteins wherein the second 
amino acid portion includes 9 or more contiguous amino acids encoded by (1) the 
combined upstream and downstream open reading frame (ORF) of B726P as depicted in 
SEQ ED NO:475; (2) the upstream ORF of B726P as depicted in SEQ ED NO:469; and 

20 (3) the downstream ORF of B726P as depicted m SEQ ED NO: 176. Fusion proteins 
according to the present invention may also comprise a second amino acid portion that 
includes 9 or more contiguous amino acids from the amino acid sequence depicted by 
amino acids 1-129 of SEQ ID NO:475, Still additional exemplary fusion proteins are 
depicted herein by SEQ ID NO:493, SEQ ID NO:494, and SEQ ID NO:495. 

25 Fusion proteins are provided wherein the mammaglobin amino acid 

portion is connected to the amino-terminus of the B726P amino acid portion while other 
fusion proteins are provided wherein the mammaglobin amino acid portion is connected 
to the carboxy-terminus of the B726P amino acid portion. The coimection between the 
mammaglobin anaino acid portion and the B726P portion may be a covalent bond. 

30 Additionally, a stretch of amino acids either unrelated or related to either mammaglobin 
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and/or B726P may be incorporated between or either amino- or carboxy-terminal to 
either the maramaglobin and/or B726P amino acid portion. 

The present invention also provides isolated polynucleotides that encode 
any of the fusion proteins that are specifically disclosed herein as well as those fusion 
5 proteins that may be accomplished with routine experimentation by the ordinarily 
skilled artisan. 

Within further aspects, the present invention provides methods for 
sthnulating an immune response in a patient, preferably a T cell response in a human 
patient, comprising administering a pharmaceutical composition described herein. The 
0 patient may be afflicted with breast cancer, in which case the methods provide treatment 
for the disease, or patient considered at risk for such a disease may be treated 
prophylactically. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
5 patient a pharmaceutical composition as recited above. The patient may be afflicted 
with breast cancer, in which case the methods provide treatment for the disease, or 
patient considered at risk for such a disease may be treated prophylactically. 

The present invention further provides, within other aspects, methods for 
removing tumor cells from a biological sample, comprising contacting a biological 
0 sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
permit the removal of cells expressing the protein from the sample. 

Within related aspects, methods are provided for inhibiting the 
development of a cancer m a patient, comprising administering to a patient a biological 
15 sample treated as described above. 

Methods are further provided, within other aspects, for stimulating 
and/or expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that 
10 expresses such a polypeptide; under conditions and for a time sufficient to permit the 
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stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
prepared as described above are also provided. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
5 patient an effective amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
development of a cancer in a patient, comprisuig the steps of: (a) incubating CD4"^ 
and/or CDS"^ T cells isolated from a patient with one or more of: (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
10 polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
of the proliferated T cells, and thereby inhibiting the development of a cancer in the 
patient. Proliferated cells may, but need not, be cloned prior to administration to the 
patient. 

15 Within further aspects, the present invention provides methods for 

determining the presence or absence of a cancer, preferably a breast cancer, in a patient 
comprising: (a) contacting a biological sample obtained from a patient with a binding 
agent that binds to a polypeptide as recited above; (b) detecting in the sample an amount 
of polypeptide that binds to the binding agent; and (c) comparing the amount of 

20 polypeptide with a predetermined cut-off value, and therefrom determining the presence 
or absence of a cancer in the patient. Within preferred embodiments, the binding agent 
is an antibody, more preferably a monoclonal antibody. 

The present invention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient. Such methods comprise the steps 

25 of: (a) contacting a biological sample obtained from a patient at a first pomt m time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
sample an amount of polypeptide that binds to the bmding agent; (c) repeating steps (a) 
and (b) usmg a biological sample obtained from the patient at a subsequent point in 
time; and (d) comparing the amount of polypeptide detected in step (c) with the amount 

30 detected in step (b) and therefrom monitoring the progression of the cancer in the 
patient. 
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The present invention further provides, within other aspects, methods for 
determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a polypeptide of the present invention; (b) 

5 detecting in the sample a level of a polynucleotide, preferably mRNA, that hybridizes to 
the oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer m the patient. Within certain embodiments, the amount 
of mRNA is detected via polymerase chain reaction usuig, for example, at least one 

10 oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as 
recited above, or a complement of such a polynucleotide. Within other embodiments, 
the amount of mRNA is detected using a hybridization technique, employing an 
oHgonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 

15 hi related aspects, methods are provided for monitoring the progression 

of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of the present invention; (b) detecting in the sample an amount of 
a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 

20 using a biological sample obtained from the patient at a subsequent point in time; and 
(d) comparing the amount of polynucleotide detected in step (c) with the amount 
detected m step (b) and therefrom monitoring the progression of the cancer in the 
patient. 

Within fiirther aspects, the present uivention provides antibodies, such as 
25 monoclonal antibodies, that bind to a polypeptide as described above, as well as 
diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 
30 references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 
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BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE IDENTIFIERS 

Fig. 1 shows the results of a Northern blot of the clone SYN18C6 (SEQ 

IDNO:40). 

Fig. 2 shows the results of an IFN-gamma ELISPOT assay demonstrating 
5 that the B726P-specific CTL clone recognizes and lyses breast tumor cell lines 
expressing B726P. 

SEQ ID N0:1 is the determined cDNA sequence of JBT2. 

SEQ ID NO:2 is the determined cDNA sequence of JBT6. 

SEQ ID NO:3 is the determined cDNA sequence of JBT7. 
10 SEQ ID NO:4 is the determined cDNA sequence of JBTl 0. 

SEQ ID NO:5 is the determined cDNA sequence of JBT13. 

SEQ ID NO:6 is die determined cDNA sequence of JBT14. 

SEQ ID N0:7 is the determined cDNA sequence of JBTl 5. 

SEQ ID N0:8 is the determined cDNA sequence of JBT16. 
1 5 SEQ ID N0:9 is the determined cDNA sequence of JBTl 7. 

SEQ ID NO:10 is the determined cDNA sequence of JBT22. 

SEQ ID NO:l 1 is the determined cDNA sequence of JBT25. 

SEQ ID NO: 12 is the determined cDNA sequence of JBT28. 

SEQ ID NO: 1 3 is the determined cDNA sequence of JBT32. 
20 SEQ ID N0:14 is the determined cDNA sequence of JBT33. 

SEQ ID N0:1 5 is the determined cDNA sequence of JBT34. 

SEQ ID NO: 16 is the determined cDNA sequence of JBT36. 

SEQ ID NO: 1 7 is the determined cDNA sequence of JBT37. 

SEQ ID NO: 1 8 is the determined cDNA sequence of JBT5 1 . 
25 SEQ ID NO: 1 9 is the determined cDNA sequence of JBTTl . 

SEQ ID NO:20 is the determined cDNA sequence of JBTT7. 

SEQ ID N0:21 is the determined cDNA sequence of JBTTl 1 . 

SEQ ID NO:22 is the determined cDNA sequence of JBTTl 4. 

SEQ ID NO:23 is the determined cDNA sequence of JBTTl 8. 
30 SEQ ID NO:24 is the determmed cDNA sequence of JBTT19. 

SEQ ID NO:25 is the determmed cDNA sequence of JBTT20. 
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SEQ ID NO:26 is the determined cDNA sequence of JBTT21 . 

SEQ ID NO:27 is the detennined cDNA sequence of JBTT22. 

SEQ ID NO:28 is the determined cDNA sequence of JBTT28. 

SEQ ID NO:29 is the determined cDNA sequence of JBTT29. 
5 SEQ ID NO:30 is the determined cDNA sequence of JBTT33 . 

SEQ ID N0:31 is the determined cDNA sequence of JBTT37. 

SEQ ID NO:32 is the determined cDNA sequence of JBTT38. 

SEQ ID NO:33 is the determined cDNA sequence of JBTT47. 

SEQ ID NO:34 is the determined cDNA sequence of JBTT48. 
10 SEQ ID NO:35 is the detennined cDNA sequence of IBTT50. 

SEQ ID NO:36 is the determined cDNA sequence of JBTT51. 

SEQ ID NO:37 is the determined cDNA sequence of JBTT52. 

SEQ ID NO:38 is the determined cDNA sequence of JBTT54. 

SEQ ID NO:39 is the determmed cDNA sequence of SYN17F4. 
15 SEQ ID NO:40 is the determined cDNA sequence of SYN18C6 (also 

known as B709P). 

SEQ ID NO:41 is the determined cDNA sequence of SYN19A2. 

SEQ ID NO:42 is the determined cDNA sequence of SYN19C8. 

SEQ ID NO:43 is the determined cDNA sequence of SYN20A12. 
20 SEQ ID NO:44 is the determined cDNA sequence of SYN20G6. 

SEQ ID NO:45 is the determined cDNA sequence of SYN20G6-2. 

SEQ ID NO:46 is the determined cDNA sequence of SYN21B9. 

SEQ ID NO:47 is the determined cDNA sequence of SYN21B9-2. 

SEQ ED NO:48 is the determined cDNA sequence of SYN21C10. 
25 SEQ ID NO:49 is the determined cDNA sequence of SYN21G10. 

SEQ ID NO:50 is the determined cDNA sequence of SYN21G10-2. 

SEQ ID N0:51 is the determined cDNA sequence of SYN21G11. 

SEQ ID NO:52 is the determined cDNA sequence of SYN21G11-2. 

SEQ ID NO:53 is the determined cDNA sequence of SYN21H8. 
30 SEQ ID NO:54 is the deterniined cDNA sequence of S YN22A1 0. 

SEQ ID NO:55 is the detennined cDNA sequence of SYN22A10-2. 
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SEQ E) NO:56 is the determined cDNA sequence of SYN22A12. 

SEQ ID NO:57 is the determined cDNA sequence of SYN22A2 (also 
referred to as B718P). 

SEQ ID NO:58 is the determined cDNA sequence of SYN22B4. 
5 SEQ ID NO:59 is the determined cDNA sequence of SYN22C2. 

SEQ ID NO:60 is the determined cDNA sequence of SYN22E10. 

SEQ ID N0:61 is the determined cDNA sequence of SYN22F2. 

SEQ ID NO:62 is a predicted amino acid sequence for SYN18C6 (also 
known as B709P). 

10 SEQ ID NO:63 is the determined cDNA sequence of B723P. 

SEQ ID NO:64 is the determined cDNA sequence for B724P. 

SEQ ID NO:65 is the determined cDNA sequence of B770P. 

SEQ E) NO:66 is the determined cDNA sequence of B716P. 

SEQ ID NO:67 is the determined cDNA sequence of B725P. 
15 SEQ ID NO:68 is the determined cDNA sequence of B717P. 

SEQ ID NO:69 is the determined cDNA sequence of B771P. 

SEQ ID NO:70 is the detennined cDNA sequence of B722P. 

SEQ ID NO:71 is the determined cDNA sequence of B726P. 

SEQ ID NO:72 is the determined cDNA sequence of B727P. 
20 SEQ ID NO:73 is the determined cDNA sequence of B728P. 

SEQ ID NO:74-87 are the determined cDNA sequences of isolated 
clones which show homology to known sequences. 

SEQ ID NO:88 is the determined cDNA sequence of 13053. 

SEQ ID NO:89 is the determined cDNA sequence of 13057. 
25 SEQ ID NO:90 is the determined cDNA sequence of 1 3059. 

SEQ ID NO:91 is the determined cDNA sequence of 13065. 

SEQ ID NO:92 is the determined cDNA sequence of 13067. 

SEQ ID NO:93 is the detemiined cDNA sequence of 13068. 

SEQ ID NO:94 is the detemiined cDNA sequence of 13071. 
30 SEQ ID NO:95 is the determined cDNA sequence of 13072. 

SEQ ID NO:96 is the determined cDNA sequence of 13073. 
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SEQ ID NO:97 is the determined cDNA sequence of 13075. 

SEQ ID NO:98 is the determined cDNA sequence of 1 3078. 

SEQ ID NO:99 is the determmed cDNA sequence of 13079. 

SEQ ID NOrlOO is the determined cDNA sequence of 13081. 

SEQ ID NO:101 is the determined cDNA sequence of 13082. 

SEQ ID N0:1 02 is the determined cDNA sequence of 13092. 

SEQ ID NO:103 is the determined cDNA sequence of 13097. 

SEQ ID NO:104 is the determined cDNA sequence of 13101. 

SEQ ID NO: 105 is the determined cDNA sequence of 13102. 

SEQ ID NO: 106 is the determined cDNA sequence of 131 19. 

SEQ ID NO:107 is the determined cDNA sequence of 13131. 

SEQ ID NO:108 is the determined cDNA sequence of 13133. 

SEQ ID NO:109 is the determined cDNA sequence of 13135. 

SEQ ID N0:1 10 is the determined cDNA sequence of 13139. 

SEQ ID NO: 1 1 1 is the determmed cDNA sequence of 1 3 140. 

SEQ ID NO:l 12 is the determined cDNA sequence of 13 146. 

SEQ ID N0:1 13 is the determined cDNA sequence of 13147. 

SEQ ID NO: 1 14 is the determined cDNA sequence of 1 3 148. 

SEQ ID N0:1 15 is the determined cDNA sequence of 13149. 

SEQ ID N0:1 16 is the determined cDNA sequence of 13151. 
SEQ ID NO:l 17 is the determined cDNA sequence of 13051 

SEQ ID NO:l 18 is the determined cDNA sequence of 13052 

SEQ ID NO:l 19 is the detennined cDNA sequence of 13055 

SEQ ID NO:120 is the determined cDNA sequence of 13058 

SEQ ID N0:121 is the determined cDNA sequence of 13062 

SEQ ID NO: 1 22 is the determined cDNA sequence of 1 3064 

SEQ ID NO: 1 23 is the determined cDNA sequence of 1 3080 

SEQ ID NO: 124 is the determined cDNA sequence of 13093 

SEQ ID NO:125 is the determmed cDNA sequence of 13094 

SEQ ID NO:126 is the determined cDNA sequence of 13095 

SEQ ID NO:127 is the determined cDNA sequence of 13096 

SEQ ID NO: 128 is the determined cDNA sequence of 13099 
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SEQ ID NO:129 is the determined cDNA sequence of 13100 
SEQ ID NO: 130 is the determined cDNA sequence of 13103 
SEQ ID NO: 1 3 1 is the determined cDNA sequence of 1 3 1 06 
SEQ ID NO: 132 is the determined cDNA sequence of 13 107 
5 SEQ ID NO: 1 33 is the determined cDNA sequence of 1 3 1 08 

SEQ ID NO:134 is the determined cDNA sequence of 13121 
SEQ ID NO:135 is the determined cDNA sequence of 13126 
SEQ ID NO:136 is the determined cDNA sequence of 13129 
SEQ ID NO:137 is the determined cDNA sequence of 13130 

10 SEQ ID NO:138isthedetenninedcDNAsequenceof 13134 

SEQ ID NO:139 is the determined cDNA sequence of 13141 
SEQ ID NO:140 is the determined cDNA sequence of 13142 
SEQ ID NO: 141 is the determined cDNA sequence of 14376 
SEQ ID NO: 142 is the determined cDNA sequence of 14377 

15 SEQ ID NO: 143 is the determined cDNA sequence of 14383 

SEQ ID NO: 144 is the determined cDNA sequence of 14384 
SEQ ID NO: 145 is the determined cDNA sequence of 14387 
SEQ ID NO:146 is the determined cDNA sequence of 14392 
SEQ ID NO: 147 is the determined cDNA sequence of 14394 

20 SEQ ID NO:148 is the determined cDNA sequence of 14398 

SEQ ID NO: 149 is the determined cDNA sequence of 14401 
SEQ ID NO: 150 is the determined cDNA sequence of 14402 
SEQ ID NO:151 is the determined cDNA sequence of 14405 
SEQ ID NO: 152 is the determined cDNA sequence of 14409 

25 SEQ ID NO: 1 53 is the determined cDNA sequence of 14412 

SEQ ID NO:154 is the determined cDNA sequence of 14414 
SEQ ID NO:155 is the determined cDNA sequence of 14415 
SEQ ID NO:156 is the determined cDNA sequence of 14416 
SEQ ID NO: 1 57 is the determined cDNA sequence of 14419 

30 SEQ ID NO: 158 is the determined cDNA sequence of 14426 

SEQ ID NO: 1 59 is the determined cDNA sequence of 1 4427 
SEQ ID NO: 160 is the determined cDNA sequence of 14375 
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SEQ ID NO: 161 is the determined cDNA sequence of 14378 
SEQ ID NO: 162 is the determined cDNA sequence of 14379 
SEQ ID NO: 163 is the determined cDNA sequence of 14380 
SEQ ID NO: 1 64 is the determined cDNA sequence of 1 43 8 1 
5 SEQ ID NO: 1 65 is the determined cDNA sequence of 1 43 82 

SEQ ID NO: 166 is the determined cDNA sequence of 14388 
SEQ ID NO:167 is the determined cDNA sequence of 14399 
SEQ ID NO:168 is the determmed cDNA sequence of 14406 
SEQ ID NO:169 is the determined cDNA sequence of 14407 

10 SEQ ID N0:170 is the determined cDNA sequence of 14408 

SEQ ID NO: 171 is the determined cDNA sequence of 14417 
SEQ ID NO:172 is the determined cDNA sequence of 14418 
SEQ ID NO:173 is the determined cDNA sequence of 14423 
SEQ ID NO:174 is the determined cDNA sequence of 14424 

15 SEQ ID NO:175 is the determined cDNA sequence of B726P-20 

SEQ ID NO: 176 is the predicted amino acid sequence of B726P-20 (also 
referred to as B726P downstream ORE) 
SEQ ID NO:177 is a PGR primer 

SEQ ID NO:178 is the determmed cDNA sequence of B726P-74 
20 SEQ ID NO: 1 79 is the predicted amino acid sequence of B726P-74 

SEQ ID NO: 180 is the determined cDNA sequence of B726P-79 
SEQ ID N0:181 is the predicted amino acid sequence of B726P-79 
SEQ ID NO: 182 is the determined cDNA sequence of 19439.1, showing 
homology to the mammaglobin gene 
25 SEQ ID NO: 183 is the determined cDNA sequence of 19407.1, showing 

homology to the human keratin gene 

SEQ ED NO: 1 84 is the determined cDNA sequence of 1 9428. 1 , showing 
homology to himian chromosome 17 clone 

SEQ ED NO: 185 is the determined cDNA sequence of B808P (19408), 
30 showing no significant homology to any known gene 

SEQ ID NO: 186 is the determmed cDNA sequence of 19460.1, showing 
no significant homology to any known gene 
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SEQ ID NO: 187 is the determined cDNA sequence of 19419.1, showing 
homology to Ig kappa light chain 

SEQ ID NO:188 is the detennined cDNA sequence of 19411.1, showing 
homology to human alpha-1 collagen 
5 SEQ ID NO:189 is the determined cDNA sequence of 19420.1, showing 

homology to mus musculus proteinase-3 

SEQ ID NO: 190 is the detennined cDNA sequence of 19432.1, showing 
homology to human high motiUty group box 

SEQ ID N0:191 is the detennined cDNA sequence of 19412.1, showing 
1 0 homology to the human plasminogen activator gene 

SEQ ID NO:192 is the determined cDNA sequence of 19415.1, showing 
homology to mitogen activated protein kinase 

SEQ ID NO: 193 is the detennined cDNA sequence of 19409.1, showing 
homology to the chondroitin sulfate proteoglycan protein 
15 SEQ ID NO: 194 is the determined cDNA sequence of 19406.1, showing 

no significant homology to any known gene 

SEQ ID NO:195 is the detennined cDNA sequence of 19421.1, showing 
homology to human fibronectin 

SEQ ID NO: 196 is the determined cDNA sequence of 19426.1, showing 
20 homology to the retinoic acid receptor responder 3 

SEQ ID NO: 197 is the detennined cDNA sequence of 19425.1, showing 
homology to MyD88 niRNA 

SEQ ID NO:198 is the detennined cDNA sequence of 19424.1, showing 
homology to peptide transporter (TAP-1) mRNA 
25 SEQ ID NO:199 is the determined cDNA sequence of 19429.1, showing 

no significant homology to any knovm gene 

SEQ ID NO:200 is the detemuned cDNA sequence of 19435.1, showing 
homology to human polymorphic epithelial mucin 

SEQ ID NO:201 is the detennmed cDNA sequence of B813P (19434.1), 
30 showing homology to human GATA-3 transcription factor 

SEQ ID NO:202 is the determined cDNA sequence of 19461.1, showing 
homology to the human AP-2 gene 
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SEQ ID NO:203 is the determined cDNA sequence of 19450.1, showing 
homology to DNA binding regulatory factor 

SEQ ID NO:204 is the determined cDNA sequence of 19451.1, showing 
homology to Na/H exchange regulatory co-factor 
5 SEQ ID NO:205 is the determmed cDNA sequence of 19462.1, showing 

no significant homology to any known gene 

SEQ ID NO:206 is the determined cDNA sequence of 19455.1, showing 
homology to human mRNA for histone HAS.Z 

SEQ ID NO:207 is the determined cDNA sequence of 19459.1, showing 
10 homology to PAC clone 179N16 

SEQ ID NO:208 is the determmed cDNA sequence of 19464.1, showing 
no significant homology to any known gene 

SEQ ID NO:209 is the determined cDNA sequence of 19414.1, showing 
homology to lipophilin B 
15 SEQ ID NO:210 is the determined cDNA sequence of 19413.1, showing 

homology to chromosome 17 clone hRPK.209_J_20 

SEQ ID N0:21 1 is the determined cDNA sequence of 19416.1, showing 
no significant homology to any known gene 

SEQ ID NO:212 is the determined cDNA sequence of 19437.1, showmg 
20 homology to human clone 24976 mKNFA 

SEQ ID NO:213 is the determined cDNA sequence of 19449.1, showing 
homology to mouse DNA for PG-M core protein 

SEQ ID NO:214 is the determined cDNA sequence of 19446.1, shoAvmg 
no significant homology to any known gene 
25 SEQ ID NO:215 is the determined cDNA sequence of 19452.1, showing 

no significant homology to any known gene 

SEQ ID NO:216 is the determined cDNA sequence of 19483.1, showing 
no significant homology to any known gene 

SEQ ID NO:217 is the determined cDNA sequence of 19526.1, showing 
3 0 homology to human lipophilin C 

SEQ ID NO:218 is the determined cDNA sequence of 19484.1, showing 
homology to the secreted cement gland protein XAG-2 
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SEQ ID NO:219 is the determined cDNA sequence of 19470.1, showing 
no significant homology to any known gene 

SEQ ID NO:220 is the determined cDNA sequence of 19469.1, showing 
homology to the human HLA-DM gene 
5 SEQ ID NO:221 is the determined cDNA sequence of 19482.1, showing 

homology to the human pS2 protein gene 

SEQ ID NO:222 is the determined cDNA sequence of B805P (19468.1), 
showing no significant homology to any known gene 

SEQ ID NO:223 is the determined cDNA sequence of 19467.1, showing 
1 0 homology to human thrombospondin mRNA 

SEQ ID NO:224 is the determined cDNA sequence of 19498.1, showing 
homology to the CDC2 gene involved in cell cycle control 

SEQ ID NO:225 is the determined cDNA sequence of 19506.1, shovwng 
homology to human cDNA for TREB protein 
15 SEQ ID NO:226 is the determined cDNA sequence of B806P (19505.1), 

showing no significant homology to any known gene 

SEQ ID NO:227 is the determined cDNA sequence of 19486.1, showing 
homology to type I epidermal keratin 

SEQ ID NO:228 is the determined cDNA sequence of 19510.1, showing 
20 homology to glucose transporter for glycoprotein 

SEQ ID NO:229 is the determined cDNA sequence of 19512.1, showing 
homology to the human lysyl hydroxylase gene 

SEQ ID NO:230 is the determined cDNA sequence of 1951 1.1, showing 
homology to himian palimotoyl-protein thioesterase 
25 SEQ ID NO:23 1 is the determined cDNA sequence of 19508.1, showing 

homology to human alpha enolase 

SEQ ID NO:232 is the determined cDNA sequence of B807P (19509.1), 
showing no significant homology to any known gene 

SEQ ID NO:233 is the determined cDNA sequence of B809P (19520.1), 
30 showing homology to clone 102D24 on chromosome 1 lql3.31 

SEQ ID NO:234 is the determined cDNA sequence of 19507.1, showing 
homology toprosome beta-subunit 
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SEQ K) NO:235 is the determiued cDNA sequence of 19525.1, showing 
homology to human pro-urokinase precursor 

SEQ ID NO:236 is the determined cDNA sequence of 19513.1, showing 
no significant homology to any known gene 
5 SEQ ID NO:237 is the determined cDNA sequence of 1 95 1 7. 1 , showing 

homology to hviman PAC 128M19 clone 

SEQ ID NO:238 is the determined cDNA sequence of 19564.1, showing 
homology to human cytochrome P450-IIB 

SEQ ID NO:239 is the determined cDNA sequence of 19553.1, showing 
1 0 homology to human GAB A-A receptor pi subunit 

SEQ ID NO:240 is the determined cDNA sequence of B811P (19575.1), 
showing no significant homology to any known gene 

SEQ ID NO:241 is the determined cDNA sequence of B810P (19560.1), 
showing no significant homology to any known gene 
1 5 SEQ ID NO:242 is the determmed cDNA sequence of 1 9588. 1 , showing 

homology to aortic carboxypetidase-Uke protein 

SEQ ID NO:243 is the determined cDNA sequence of 19551.1, showing 
homology to human BCL-1 gene 

SEQ ID NO:244 is the determined cDNA sequence of 19567.1, showing 
20 homology to human proteasome-related mRNA 

SEQ ID NO:245 is the determined cDNA sequence of B803P (19583.1), 
showing no significant homology to any known gene 

SEQ ID NO:246 is the determmed cDNA sequence of B812P (19587.1), 
showing no significant homology to any known gene 
25 SEQ ID NO:247 is the determined cDNA sequence of B802P (19392.2), 

showing homology to human chromosome 17 

SEQ ID NO:248 is the determined cDNA sequence of 19393.2, showing 
homology to human nicein B2 chain 

SEQ ID NO:249 is the determined cDNA sequence of 19398.2, human 
30 MHC class n DQ alpha mRNA 

SEQ ID NO:250 is the determined cDNA sequence of B804P (19399.2), 
showing homology to human Xp22 BAG GSHB-184P14 
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SEQ ED NO:251 is the determined cDNA sequence of 19401.2, showing 
homology to human ikB kinase-b gene 

SEQ E) NO:252 is the determined cDNA sequence of 20266, showing 
no significant homology to any known gene 
5 SEQ ID NO:253 is the determined cDNA sequence of B826P (20270), 

showing no significant homology to any known gene 

SEQ ID NO:254 is the detennined cDNA sequence of 20274, showing 
no significant homology to any known gene 

SEQ ID NO:255 is the determined cDNA sequence of 20276, showing 
10 no significant homology to any known gene 

SEQ ID NO:256 is the determined cDNA sequence of 20277, showing 
no significant homology to any known gene 

SEQ ID NO:257 is the determined cDNA sequence of B823P (20280), 
showing no significant homology to any known gene 
15 SEQ ID NO:258 is the^ determined cDNA sequence of B821P (20281), 

showing no significant homology to any known gene 

SEQ ID NO:259 is the determined cDNA sequence of B824P (20294), 
showing no significant homology to any known gene 

SEQ E) NO:260 is the determined cDNA sequence of 20303, showmg 
20 no significant homology to any known gene 

SEQ ID NO:261 is the determmed cDNA sequence of B820P (20310), 
showing no significant homology to any known gene 

SEQ ID NO:262 is the determmed cDNA sequence of B825P (20336), 
showing no significant homology to any known gene 
25 SEQ ID NO:263 is the determined cDNA sequence of B827P (20341), 

showing no significant homology to any known gene 

SEQ ID NO:264 is the determined cDNA sequence of 20941, showing 
no significant homology to any known gene 

SEQ ID NO:265 is the determined cDNA sequence of 20954, showing 
30 no significant homology to any known gene 

SEQ ID NO:266 is the determined cDNA sequence of 20961, showing 
no significant homology to any known gene 
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SEQ ID NO:267 is the determined cDNA sequence of 20965, showing 
no significant homology to any known gene 

SEQ ID NO:268 is the determined cDNA sequence of 20975, showing 
no significant homology to any known gene 
5 SEQ ED NO:269 is the determmed cDNA sequence of 20261, showing 

homology to Human pi 20 catenin 

SEQ ID NO:270 is the determined cDNA sequence of B822P (20262), 
showing homology to Human membrane glycoprotein 4F2 

SEQ ID NO:271 is the determined cDNA sequence of 20265, showing 
1 0 homology to Human Na, K-ATPase Alpha 1 

SEQ ID NO:272 is the deteraiined cDNA sequence of 20267, showing 
homology to Human heart HS 90, partial cds 

SEQ ID NO:273 is the deteraiined cDNA sequence of 20268, showing 
homology to Human mRNA GPI-anchored protein pi 37 
15 SEQ ID NO:274 is the detennined cDNA sequence of 20271, showing 

homology to Human cleavage stimulation factor 77 kDa subunit 

SEQ ID NO:275 is the determmed cDNA sequence of 20272, showmg 
homology to Human pl90-B 

SEQ ID NO:276 is the determined cDNA sequence of 20273, showing 
20 homology to Human ribophorin 

SEQ ID NO:277 is the detemiined cDNA sequence of 20278, showing 
homology to Human omithme amino transferase 

SEQ ID NO:278 is the detemiined cDNA sequence of 20279, showing 
homology to Human S-adenosyhnethionine synthetase 
25 SEQ ID NO:279 is the determined cDNA sequence of 20293, showing 

homology to Human x mactivation transcript 

SEQ ID NO:280 is the determined cDNA sequence of 20300, showing 
homology to Human cytochrome p450 

SEQ ID NO:281 is the detennined cDNA sequence of 20305, showing 
30 homology to Human elongation factor-1 alpha 

SEQ ID NO:282 is the detennined cDNA sequence of 20306, showing 
homology to Human epithelial ets protein 
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SEQ ID NO:283 is the detennined cDNA sequence of 20307, showing 
homology to Human signal transducer mRNA 

SEQ ED NO:284 is the determined cDNA sequence of 20313, showing 
homology to Human GABA-A receptor pi subunit mRNA 
5 SEQ ID NO:285 is the determined cDNA sequence of 20317, showing 

homology to Human tyrosine phosphatase 

SEQ ID NO:286 is the determined cDNA sequence of 20318, showing 
homology to Human cathepsine B proteinase 

SEQ ID NO:287 is the determined cDNA sequence of 20320, showing 
1 0 homology to Human 2-phosphopyruvate-hydratase-alpha-enolase 

SEQ ID NO:288 is the detennined cDNA sequence of 20321, showing 
homology to Human E-cadherin 

SEQ ID NO:289 is the determined cDNA sequence of 20322, showing 
homology to Human hsp86 
15 SEQ ID NO:290 is the determined cDNA sequence of B828P (20326), 

showing homology to Human x inactivation transcript 

SEQ ID NO:291 is the determined cDNA sequence of 20333, showmg 
homology to Human chromatin regulator, SMARCA5 

SEQ ID NO:292 is the determined cDNA sequence of 20335, showing 
20 homology to Human sphingolipid activator protein 1 

SEQ ID NO:293 is the detennined cDNA sequence of 20337, showing 
homology to Human hepatocyte growth factor activator inhibitor type 2 

SEQ ID NO:294 is the determined cDNA sequence of 20338, showing 
homology to Human cell adhesion molecule CD44 
25 SEQ ID NO:295 is the determined cDNA sequence of 20340, showing 

homology to Human nuclear factor (erythroid-derived)-Iike 1 

SEQ ID NO:296 is the determined cDNA sequence of 20938, showing 
homology to Human vinculin mRNA 

SEQ ID NO:297 is the detennined cDNA sequence of 20939, showing 
30 homology to Human elongation factor EF-l-alpha 

SEQ ID NO:298 is the determined cDNA sequence of 20940, showing 
homology to Human nestin gene 
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SEQ ID NO:299 is the determined cDNA sequence of 20942, showing 
homology to Human pancreatic ribonuclease 

SEQ ID NO:300 is the determined cDNA sequence of 20943, showing 
homology to Human transcobalamin I 
5 SEQ ID NO:301 is the determined cDNA sequence of 20944, showing 

homology to Human beta-tubulin 

SEQ ID NO:302 is the determined cDNA sequence of 20946, showing 
homology to Human HS 1 protein 

SEQ ID NO:303 is the determined cDNA sequence of 20947, showing 
1 0 homology to Hxmian cathepsin B 

SEQ ID NO:304 is the determined cDNA sequence of 20948, showing 
homology to Human testis enhanced gene transcript 

SEQ ID NO:305 is the determmed cDNA sequence of 20949, showing 
homology to Human elongation factor EF-1 -alpha 
15 SEQ ID NO:306 is the determined cDNA sequence of 20950, showing 

homology to Human ADP-ribosylation factor 3 

SEQ ID NO:307 is the determined cDNA sequence of 20951, showing 
homology to Human IFP53 or WRS for tiyptophanyl-tRNA synthetase 

SEQ ID NO:308 is the determined cDNA sequence of 20952, showing 
20 homology to Human cyclin-dependent protein kinase 

SEQ ID NO:309 is the determined cDNA sequence of 20957, showing 
homology to Human alpha-tubulin isoform 1 

SEQ ID NO:310 is the determined cDNA sequence of 20959, showing 
homology to Human tyrosine phosphatase-61bp deletion 
25 SEQ ID N0:311 is the determined cDNA sequence of 20966, showing 

homology to Human tyrosine phosphatase 

SEQ ID NO:312 is the determined cDNA sequence of B830P (20976), 
showing homology to Human nuclear factor NF 45 

SEQ ID NO:313 is the determined cDNA sequence of B829P (20977), 
30 showing homology to Human delta-6 fatty acid desaturase 

SEQ ID NO:314 is the determined cDNA sequence of 20978, showing 
homology to Human nuclear aconitase 
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SEQ ID NO:315 is the determined cDNA sequence of clone 23176. 
SEQ ID NO:316 is the determined cDNA sequence of clone 23140. 
SEQ ED NO:317 is the determined cDNA sequence of clone 23166. 
SEQ ID N0:3 1 8 is the determined cDNA sequence of clone 23167. 
5 SEQ ID N0:3 1 9 is the determined cDNA sequence of clone 23 1 77. 

SEQ ID NO:320 is the determined cDNA sequence of clone 23217. 
SEQ ID NO:321 is the determined cDNA sequence of clone 23169. 
SEQ ID NO:322 is the determined cDNA sequence of clone 23160. 
SEQ ID NO:323 is the determined cDNA sequence of clone 23182. 
1 0 SEQ ID NO:324 is the determined cDNA sequence of clone 23232. 

SEQ ID NO:325 is the determined cDNA sequence of clone 23203. 
SEQ ID NO:326 is the determined cDNA sequence of clone 23198. 
SEQ ID NO:327 is the determined cDNA sequence of clone 23224. 

SEQ ID NO:328 is the determined cDNA sequence of clone 23142. 
15 SEQ ID NO:329 is the determined cDNA sequence of clone 23138. 

SEQ ID NO:330 is the determined cDNA sequence of clone 23 147. 

SEQ ID NO:331 is the determined cDNA sequence of clone 23148. 

SEQ ID NO:332 is the determined cDNA sequence of clone 23149. 

SEQ ID NO:333 is the determined cDNA sequence of clone 23172. 
20 SEQ ID NO:334 is the determined cDNA sequence of clone 23158. 

SEQ ID NO:335 is the determined cDNA sequence of clone 23156. 

SEQ ID NO:336 is the determined cDNA sequence of clone 23221 . 

SEQ ID NO:337 is the determined cDNA sequence of clone 23223. 

SEQ ID NO:338 is the determined cDNA sequence of clone 23155. 
25 SEQ ID NO:339 is the determined cDNA sequence of clone 23225. 

SEQ ID NO:340 is the determined cDNA sequence of clone 23226. 

SEQ ID NO:341 is the determined cDNA sequence of clone 23228. 

SEQ ID NO:342 is the determined cDNA sequence of clone 23229. 

SEQ ID NO:343 is the determined cDNA sequence of clone 23231. 
30 SEQ ID NO:344 is the determined cDNA sequence of clone 23 1 54. 

SEQ ID NO:345 is the determined cDNA sequence of clone 23157. 

SEQ ID NO:346 is the determined cDNA sequence of clone 23153. 
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SEQ ID NO:347 is the determined 
SEQ ID NO:348 is the determined 
SEQ ID NO:349 is the determined 
SEQ ID NO:350 is the determined 
SEQ ID NO:351 is the determined 
SEQ ID NO:352 is the determined 
SEQ ID NO:353 is the determined 
SEQ ID NO:354 is the determined 
SEQ ID NO:355 is the determined 
SEQ ID NO:356 is the determined 
SEQ ID NO:357 is the determined 
SEQ ID NO:358 is the determined 
SEQ ID NO:359 is the determined 
SEQ ID NO:360 is the determined 
SEQ ED NO:361 is the determined 
SEQ ID NO:362 is the determined 
SEQ ID NO:363 is the determined 
SEQ ID NO:364 is the determined 
SEQ ID NO:365 is the determined 
SEQ ID NO:366 is the determined 
SEQ LD NO:367 is the determined 
SEQ ID NO:368 is the determined 
SEQ ID NO:369 is the determined 
SEQ ID NO:370 is the determined 
SEQ ID NO:371 is the determined 
SEQ ID NO:372 is the determined 
SEQ ID NO:373 is the determined 
SEQ ID NO:374 is the determined 
SEQ ID NO:375 is the determined 
SEQ ID NO:376 is the determined 
SEQ ID NO:378 is the determined 
SEQ ID NO:379 is the determmed 



cDNA sequence of clone 23159. 
cDNA sequence of clone 23 1 52. 
cDNA sequence of clone 23161. 
cDNA sequence of clone 23162. 
cDNA sequence of clone 23163. 
cDNA sequence of clone 23 164. 
cDNA sequence of clone 23165. 
cDNA sequence of clone 23151. 
cDNA sequence of clone 23 1 50. 
cDNA sequence of clone 23168. 
cDNA sequence of clone 23146. 
cDNA sequence of clone 23170. 
cDNA sequence of clone 23 171 , 
cDNA sequence of clone 23145. 
cDNA sequence of clone 23174. 
cDNA sequence of clone 23175. 
cDNA sequence of clone 23144. 
cDNA sequence of clone 23 178. 
cDNA sequence of clone 23 179. 
cDNA sequence of clone 23180. 
cDNA sequence of clone 23 1 8 1 . 
cDNA sequence of clone 23143 
cDNA sequence of clone 23183. 
cDNA sequence of clone 23 1 84. 
cDNA sequence of clone 23185. 
cDNA sequence of clone 23 1 86. 
cDNA sequence of clone 23187. 
cDNA sequence of clone 23 190, 
cDNA sequence of clone 23 189. 
cDNA sequence of clone 23202. 
cDNA sequence of clone 23191 . 
cDNA sequence of clone 23 1 88. 
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SEQIDNO:380 is the 
SEQIDNO:381 is the 
SEQIDNO:382 is the 
SEQIDNO:383 is the 
5 SEQIDNO:384isthe 
SEQIDNO:385 is the 
SEQIDNO:386 is the 
SEQIDNO:387 is the 
SEQIDNO:388 isthe 

10 SEQIDNO:389isthe 
SEQIDNO:390isthe 
SEQIDNO:391is the 
SEQIDNO:392 is the 
SEQIDNO:393 is the 

15 SEQIDNO:394isthe 
SEQIDNO:395 is the 
SEQIDNO:396 is the 
SEQIDNO:397 is the 
SEQIDNO:398 is the 

20 SEQIDNO:399isthe 
SEQIDNO:400 is the 
SEQIDNO:401isthe 
SEQIDNO:402 is the 
SEQIDNO:403 is the 

25 SEQIDNO:404isthe 
SEQIDNO:405 is the 
SEQIDNO:406 is the 
SEQ ID NO:407 is the 
SEQIDNO:408isthe 

30 SEQIDNO:409isthe 
SEQIDNO:410is the 
SEQIDN0:411 is the 



determined cDNA sequence of clone 23194. 
determined cDNA sequence of clone 23196. 
determined cDNA sequence of clone 23195. 
determined cDNA sequence of clone 23 193. 
determined cDNA sequence of clone 23199. 
determined cDNA sequence of clone 23200. 
determined cDNA sequence of clone 23192. 
determined cDNA sequence of clone 23201, 
determined cDNA sequence of clone 23 1 4 1 . 
determined cDNA sequence of clone 23139. 
determined cDNA sequence of clone 23204. 
determined cDNA sequence of clone 23205. 
determined cDNA sequence of clone 23206. 
determined cDNA sequence of clone 23207. 
determined cDNA sequence of clone 23208. 
determined cDNA sequence of clone 23209. 
determined cDNA sequence of clone 23210. 
determined cDNA sequence of clone 23211. 
determined cDNA sequence of clone 23212. 
determined cDNA sequence of clone 23214. 
determined cDNA sequence of clone 23215. 
determined cDNA sequence of clone 23216. 
determined cDNA sequence of clone 23137. 
determined cDNA sequence of clone 23218. 
determined cDNA sequence of clone 23220. 
determined cDNA sequence of clone 19462, 
determined cDNA sequence of clone 19430. 
determined cDNA sequence of clone 19407. 
determined cDNA sequence of clone 19448. 
determined cDNA sequence of clone 19447. 
determined cDNA sequence of clone 19426. 
determined cDNA sequence of clone 19441. 
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SEQ K) NO:412 is the determined cDNA sequence of clone 19454. 

SEQ ID NO:413 is the determined cDNA sequence of clone 19463. 

SEQ ED NO:414 is the determined cDNA sequence of clone 19419. 

SEQ ID NO:415 is the determined cDNA sequence of clone 19434. 
5 SEQ ID N0:41 6 is the determined extended cDNA sequence of B820P. 

SEQ IDNO:417 is the determined extended cDNA sequence of B821P. 

SEQ ID NO:418 is the determined extended cDNA sequence of B822P. 

SEQ ID NO:419 is the determined extended cDNA sequence of B823P. 

SEQ ID NO:420 is the detennined extended cDNA sequence of B824P. 
10 SEQ ID NO:421 is the determined extended cDNA sequence of B825P. 

SEQ ID NO:422 is the determined extended cDNA sequence of B826P. 

SEQ ID NO:423 is the determined extended cDNA sequence of B827P. 

SEQ ID NO:424 is the determined extended cDNA sequence of B828P. 

SEQ ID NO:425 is the determined extended cDNA sequence of B829P. 
1 5 SEQ ID NO:426 is the determined extended cDNA sequence of B830P. 

SEQ ID NO:427 is the determined cDNA sequence of clone 266B4. 

SEQ ID NO:428 is the determined cDNA sequence of clone 22892. 

SEQ ID NO:429 is the determined cDNA sequence of clone 266G3. 

SEQ ID NO:430 is the determined cDNA sequence of clone 22890. 
20 SEQ ID NO:43 1 is the detennined cDNA sequence of clone 264B4. 

SEQ ID NO:432 is the determined cDNA sequence of clone 22883. 

SEQ ID NO:433 is the determined cDNA sequence of clone 22882. 

SEQ ID NO:434 is the determined cDNA sequence of clone 22880. 

SEQ ID NO:435 is the determined cDNA sequence of clone 263G1. 
25 SEQ ID NO:436 is the determined cDNA sequence of clone 263G6. 

SEQ ID NO:437 is the determined cDNA sequence of clone 262B2. 

SEQ ID NO:438 is the determined cDNA sequence of clone 262B6. 

SEQ ID NO:439 is the determined cDNA sequence of clone 22869. 

SEQ ED NO:440 is the determined cDNA sequence of clone 21374. 
30 SEQ ID NO:441 is the detennined cDNA sequence of clone 21362. 

SEQ ID NO;442 is the determined cDNA sequence of clone 21349. 

SEQ ID NO:443 is the determined cDNA sequence of clone 21309. 
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SEQ ID NO:444 is the determined cDNA sequence of clone 21097. 

SEQ ID NO:445 is the determined cDNA sequence of clone 21096. 

SEQ ID NO:446 is the determined cDNA sequence of clone 21094. 

SEQ ID NO:447 is the determined cDNA sequence of clone 21093. 
5 SEQ ID NO:448 is the determined cDNA sequence of clone 21091. 

SEQ ID NO:449 is the determined cDNA sequence of clone 21089. 

SEQ ID NO:450 is the determined cDNA sequence of clone 21087. 

SEQ ID NO:451 is the determined cDNA sequence of clone 21085. 

SEQ ID NO;452 is the determined cDNA sequence of clone 21084. 
10 SEQ ID NO:453 is a first partial cDNA sequence of clone 2BT1-40. 

SEQ ID NO:454 is a second partial cDNA sequence of clone 2BT1-40. 

SEQ ID NO:455 is the determined cDNA sequence of clone 21063. 

SEQ ID NO:456 is the determined cDNA sequence of clone 21062. 

SEQ ID NO:457 is the determined cDNA sequence of clone 21060. 
1 5 SEQ ID NO:458 is the determined cDNA sequence of clone 21 053. 

SEQ ID NO:459 is the determined cDNA sequence of clone 21050. 

SEQ ID NO:460 is the determined cDNA sequence of clone 21036. 

SEQ ID NO:461 is the determined cDNA sequence of clone 21037. 

SEQ ID NO:462 is the determined cDNA sequence of clone 21048. 
20 SEQ ID NO:463 is a consensus DNA sequence of B726P (referred to as 

B726P-spUced_seq_B726P). 

SEQ ID NO:464 is the determined cDNA sequence of a second splice 
form of B726P (referred to as 27490.seq_B726P). 

SEQ ID NO:465 is the determined cDNA sequence of a third splice form 
25 of B726P (referred to as 27068.seq_B726P). 

SEQ ID NO:466 is the determined cDNA sequence of a second splice 
form of B726P (referred to as 231 13.seq_B726P). 

SEQ ID NO:467 is the determmed cDNA sequence of a second splice 
form of B726P (referred to as 23103.seq_B726P). 
30 SEQ ID NO:468 is the determined cDNA sequence of a second splice 

form of B726P (referred to as 19310.seq_B726P). 
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SEQ ID NO:469 is the predicted amino acid sequence encoded by the 
upstream ORF of SEQ IDNO:463. 

SEQ ID NO:470 is the predicted amino acid sequence encoded by SEQ 

IDNO:464. 

5 SEQ ID NO:471 is the predicted amino acid sequence encoded by SEQ 

IDNO:465. 

SEQ ID NO:472 is the predicted amino acid sequence encoded by SEQ 

IDNO:466. 

SEQ ID NO:473 is the predicted amino acid sequence encoded by SEQ 

10 IDNO:467. 

SEQ ID NO:474 is the determined cDNA sequence for an alternative 
splice form of B726P. 

SEQ ID NO:475 is the amino acid sequence encoded by SEQ ID 

NO:474. 

1 5 SEQ ID NO:476 is the isolated cDNA sequence of B720P. 

SEQ ID NO:477 is the cDNA sequence of a known keratin gene. 

SEQ ID NO:478 is the amino acid sequence encoded by SEQ ID 

NO:477. 

SEQ ID NO:479 is the determined cDNA sequence for clone 19465. 
20 SEQ ID NO:480 and 48 1 are PGR primers. 

SEQ ID NO:482 is the cDNA sequence for the expressed doAvnstream 
ORF of B726P. 

SEQ ID NO:483 is the amino acid sequence for the expressed 
recombinant downstream ORF of B726P. 
25 SEQ ID NO:484 is the determined full-length cDNA sequence for 

B720P. 

SEQ ID NO:485 is the amino acid sequence encoded by SEQ ID 

NO:484. 

SEQ ID NO:486 is the determined cDNA sequence of a truncated form 
30 of B720P, referred to as B720P-tr. 

SEQ ID NO:487 is the amino acid sequence of B720P-tr. 
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SEQ ID NO:488 is the amino acid sequence of a naturally processed 
epitope of B726P recognized by B726P-specific CTL. 

SEQ ID NO:489 is a DNA sequence encoding the B726P epitope set 
forth m SEQ ID NO:488. 
5 SEQ ID NO:490 is a DNA sequence encoding a fusion protein wherein 

mammaglobin is fused to the B726P combined upstream and downstream open reading 
frame (ORF) (the amino acid sequence of the B726P combined ORF is disclosed herein 
as SEQ ID NO:475 which is encoded by the DNA sequence of SEQ ID NO:474). 

SEQ ID NO:491 is a DNA sequence encoding a fusion protein wherem 
10 mammaglobin is fused to the B726P upstream ORF (the ammo acid sequence of the 
B726P upstream ORF is disclosed herein as SEQ ID NO:469 which is encoded by the 
DNA sequence of SEQ ID NO:463). 

SEQ ID NO:492 is a DNA sequence encoding a fusion protein wherein 
mammaglobin is fused to the B726P downstream ORF (the amino acid sequence of the 
15 B726P downstream ORF is disclosed herein as SEQ ID NO: 176 which is encoded by 
the DNA sequence of SEQ ID NO: 175). 

SEQ ID NO:493 is the amino acid sequence encoded by the DNA 
sequence of SEQ ID NO:490. 

SEQ ID NO:494 is the amino acid sequence encoded by the DNA 
20 sequence of SEQ ID NO:491 . 

SEQ ID NO:495 is the amino acid sequence encoded by the DNA 
sequence of SEQ ID NO:492. 

SEQ ID NO:496 is amino acids 59-78 of SEQ ID NO:493. 

SEQ ID NO:497 is amino acids 55-69 of SEQ ID NO:493. 
25 SEQ ID NO:498 is ammo acids 13-33 of SEQ ID NO:493. 

SEQ ID NO:499 is amino acids 41-60 of SEQ ID NO:493. 

SEQ ID NO:500 is amino acids 2-10 of SEQ ID NO:493. 

SEQ ID NO:501 is amino acids 47-59 of SEQ ID NO:493. 

SEQ ID NO:502 is amino acids 62-74 of SEQ ID NO:493. 
30 SEQ ID NO:503 is amino acids 1-93 of SEQ ID NO:493. 

SEQ ID NO:504 is the full-length cDNA sequence for B718P. 
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SEQ ID NO:505 is the cDNA sequence of the open reading j&ame of 
B718P including stop codon. 

SEQ JD NO:506 is the cDNA sequence of the open reading frame of 
B718P without stop codon. 
5 SEQ ID NO:507 is the full-length amino acid sequence of B718P. 

SEQ ED NO:508 represents amino acids M58 of SEQ ID NO:507. 
SEQ ID NO:509 represents amino acids 159-243 of SEQ ID NO:509. 
SEQ ID NO:510 is the entire cDNA sequence of the open reading frame, 
including stop codon, of a first variant of 3723?, referred to as B723P-short. 
10 SEQ ID N0:511 is the entire cDNA sequence of the open reading frame, 

without stop codon, of a first variant of B723P, referred to as B723P-short. 

SEQ ID NO:512 is the entire cDNA sequence of the open reading frame, 
including stop codon, of a second variant of B723P, referred to as B723P-long, 

SEQ ID NO:513 is the entire cDNA sequence of the open reading frame, 
1 5 without stop codon, of a second variant of B723P, referred to as B723P-long. 

SEQ ID NO:514 is the amino acid sequence of B723P-short, 
SEQ ID NO:515 is the amino acid sequence of B723P-long. 
SEQ ID NO:516 is amino acids 1-197 of B723P-short. 
SEQ ID NO:517 is ammo acids 1-232 of B723P-long. 
20 SEQ ID NO:518 is amino acids 198-243 of B723P-short. 

SEQ IDNO:519 is ammo acids 218-243 of B723P-short. 
SEQ ID NO:520 - 533 are the DNA sequences of epitopes of B726P. 
SEQ ID NO:534-547 are the amino acid sequences of epitopes of B726P. 
SEQ ID NO:548 is the cDNA sequence of B726P Combined ORE 
25 codmg_ region for expression in E. coli. 

SEQ ID NO:549 is the cDNA sequence of B726P Upstream ORE 
coding_ region for expression in £. colL 

SEQ ED NO:550 is the cDNA sequence of B726P Downstream ORF 
coding^ region for expression in E, coli. 
30 SEQ ID NO:55 1 is the amino acid sequence of B726P Downstream ORF 

encoded by the cDNA set forth m SEQ ID NO:550. 
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SEQ ED NO;552 is the amino acid sequence of B726P Upstream ORF 
with fflS, encoded by the cDNA set forth in SEQ ED NO:549. 

SEQ ID NO:553 is the amino acid sequence of B726P Combined ORF 
correct, encoded by the cDNA set forth in SEQ ID NO:548. 
5 SEQ ID NO:554-563 are PGR primers as described in Example 8. 

SEQ ED NO:564 is the cDNA sequence for NY-BR-1, an extended 
sequence of B726P. 

SEQ ID NO:565 is the amino acid sequence for NY-BR-1, an extended 
sequence of B726P, and encoded by the nucleotide sequence set forth in SEQ ED 
10 NO:564. 

SEQ ID NO:566 is the cDNA sequence for B726P XC coding region 

with changes. 

SEQ ID NO:567 is the cDNA sequence for B726P XB clone 83686 with 

2 changes from the published NY-BR-1 sequence in SEQ ED NO:564. 

15 SEQ ED NO:568 is the cDNA sequence for B726P XB clone 84330 with 

4 changes from the published NY-BR-1 sequence in SEQ ID NO:564. 

SEQ ID NO:569 is the cDNA sequence for B726P XB clone 84328 with 

3 changes from the published NY-BR-1 sequence in SEQ ID NO:564. 

SEQ ID NO:570 is the amino acid sequence for B726P XB clone 84328, 
20 encoded by the sequence set forth in SEQ ED NO:569. 

SEQ ED NO:571 is the amino acid sequence for B726P XB clone 84330, 
encoded by the sequence set forth in SEQ ED NO:568. 

SEQ ED NO:572 is the amino acid sequence for B726P XB clone 83686, 
encoded by the sequence set forth in SEQ ID NO:567. 
25 SEQ ID NO:573 is the amino acid sequence for B726P XC, encoded by 

the sequence set forth in SEQ ID NO:566. 

SEQ ED NO:574-575 are PGR primers as described in Example 12. 
SEQ ID NO:576 is the full-length cDNA sequence for NY-BR-1. 1. 
SEQ ID NO:577 is the full-length amino acid sequence for NY-BR-1.1, 
30 encoded by the nucleotide sequence set forth in SEQ ED NO:576. 
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SEQ ID NO:578 is amino acids 289-308 of the B726P downstream ORF 
and corresponds to the peptide recognized by the 220A2.1 antibody, 

SEQ ED NO:579 is amino acids 225-244 of the B726P downstream ORF 
and corresponds to the peptide recognized by the 220A19.1 antibody. 
5 SEQ ID NO:580 is amino acids 232-252 of the B726P downstream ORF 

and corresponds to the peptide recognized by the 220A19.1 and the 220A43 antibodies. 

SEQ ED NO:581 is amino acids 73-92 of the B726P downstream ORF 
and corresponds to the peptide recognized by the 220A94. 1 antibody. 

SEQ ID NO:582 is amino acids 145-164 of the B726P downstream ORF 
10 and corresponds to the peptide recognized by the 220A1 51.1 and 220A86 antibodies. 

SEQ ID NO:583 is amino acids 153-172 of the B726P downstream ORF 
and corresponds to the peptide recognized by the 220 Al 5 1.1 and 220 A86 antibodies. 

SEQ ID NO:584 is amino acids 1-20 of the B726P downstream ORF and 
corresponds to the peptide recognized by purified B726 polyclonal antibodies. 
15 SEQ ID NO:585 is amino acids 9-28 of the B726P downstream ORF and 

corresponds to the peptide recognized by purified B726 polyclonal antibodies. 

SEQ ID NO:586 is ammo acids 17-36 of the B726P downstream ORF 
and corresponds to the peptide recognized by purified B726 polyclonal antibodies. 

SEQ ID NO:587 is ammo acids 24-44 of the B726P downstream ORF 
20 and corresponds to the peptide recognized by purified B726 polyclonal antibodies. 

SEQ ID NO:588 is ammo acids 97-116 of the B726P downstream ORF 
and corresponds to the peptide recognized by purified B726 polyclonal antibodies. 

SEQ ID NO:589 is amino acids 105-124 of the B726P downstream ORF 
and corresponds to the peptide recognized by purified B726 polyclonal antibodies. 
25 SEQ ID NO:590 is amino acids 1 1 3-1 32 of the B726P downstream ORF 

and corresponds to the peptide recognized by purified B726 polyclonal antibodies. 

SEQ ID NO:591 is ammo acids 121-140 of the B726P downstream ORF 
and corresponds to the peptide recognized by purified B726 polyclonal antibodies. 

SEQ ID NO:592 is ammo acids 129-148 of the B726P downstream ORF 
30 and corresponds to the peptide recognized by purified B726 polyclonal antibodies. 
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SEQ ID NO:593 is amino acids 137456 of the B726P downstream ORP 
and corresponds to the peptide recognized by purified B726 polyclonal antibodies. 

SEQ ID NO:594 is the amino acid sequence of peptide #2732 and 
corresponds to amino acids 1-20 of the B726P doAvnstream ORE. 
5 SEQ ID NO:595 is the amino acid sequence of peptide #2733 and 

corresponds to amino acids 1 1-30 of the B726P downstream ORE. 

SEQ ID NO:596 is the amino acid sequence of peptide #2734 and 
corresponds to amino acids 21-40 of the B726P downstream ORE. 

SEQ ID NO:597 is the amino acid sequence of peptide #2735 and 
1 0 corresponds to amino acids 3 1 -50 of the B726P downstream ORE. 

SEQ ID NO:598 is the amino acid sequence of peptide #2736 and 
corresponds to amino acids 41-60 of the B726P downstream ORE. 

SEQ ID NO:599 is the amino acid sequence of peptide #2737 and 
corresponds to amino acids 51-70 of the B726P downstream ORE. 
15 SEQ ID NO:600 is the amino acid sequence of peptide #2738 and 

corresponds to amino acids 61-80 of the B726P downstream ORE. 

SEQ ID NO:601 is the amino acid sequence of peptide #2739 and 
corresponds to amino acids 71-90 of the B726P downstream ORE. 

SEQ ID NO:602 is the amino acid sequence of peptide #2740 and 
20 corresponds to amino acids 81-100 of the B726P downstream ORE. 

SEQ ID NO:603 is the amino acid sequence of peptide #2741 and 
corresponds to amino acids 91-110 of the B726P downstream ORE. 

SEQ ID NO:604 is the amino acid sequence of peptide #2742 and 
corresponds to amino acids 101-120 of the B726P downstream ORE. 
25 SEQ ID NO:605 is the amino acid sequence of peptide #2743 and 

corresponds to amino acids 1 1 1-130 of the B726P downstream ORE. 

SEQ ID NO:606 is the amino acid sequence of peptide #2744 and 
conesponds to amino acids 121-140 of the B726P downstream ORE. 

SEQ ID NO:607 is the amino acid sequence of peptide #2745 and 
30 conesponds to ammo acids 130-151 of the B726P downstream ORE. 
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SEQ ID NO:608 is the amino acid sequence of peptide #2746 and 
conesponds to amino acids 141-160 of the B726P do^^mstream ORF. 

SEQ ID NO:609 is the amino acid sequence of peptide #2747 and 
corresponds to amino acids 151-170 of the B726P downstream ORF. 
5 SEQ ID NO:610 is the amino acid sequence of peptide #2748 and 

corresponds to amino acids 161-180 of the B726P downstream ORF. 

SEQ ID N0:611 is the amino acid sequence of peptide #2749 and 
corresponds to amino acids 170-190 of the B726P downstream ORF. 

SEQ ID NO:612 is the amino acid sequence of peptide #2750 and 
10 corresponds to amino acids 181-200 of the B726P downstream ORF. 

SEQ ID NO:613 is the amino acid sequence of peptide #2751 and 
corresponds to amino acids 191-210 of the B726P downstream ORF. 

SEQ ID NO:614 is the amino acid sequence of peptide #2752 and 
corresponds to amino acids 201-220 of the B726P downstream ORF. 
J 5 SEQ ID NO:615 is the amino acid sequence of peptide #2753 and 

corresponds to amino acids 21 1-230 of the B726P downstream ORF. 

SEQ ID NO:616 is the amino acid sequence of peptide #2765 and 
corresponds to amino acids 221-240 of tiie B726P downstream ORF. 

SEQ ID NO:617 is tiie amino acid sequence of peptide #2766 and 
20 corresponds to amino acids 231-250 of tiie B726P downstream ORF. 

SEQ ID NO:618 is tiie ammo acid sequence of peptide #2767 and 
corresponds to amino adds 240-260 of tiie B726P downstream ORF. 

SEQ ID NO:619 is tiie amino acid sequence of peptide #2768 and 
corresponds to amino acids 251-270 of tiie B726P downstream ORF. 
25 SEQ ID NO:620 is tiie ammo acid sequence of peptide #2769 and 

corresponds to amino acids 261-280 of tiie B726P downstream ORF. 

SEQ ID NO:621 is tiie amino acid sequence of peptide #2770 and 
corresponds to amino acids 271-290 of tiie B726P downstream ORF. 

SEQ ID NO:622 is tiie amino acid sequence of peptide #2771 and 
30 corresponds to amino acids 281-300 of tiie B726P downstream ORF. 
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SEQ ID NO:623 is the amino acid sequence of peptide #2772 and 
corresponds to amino acids 291-310 of the B726P downstream ORF. 

SEQ ED NO:624 is the amino acid sequence of peptide #2773 and 
corresponds to amino acids 298-317 of the B726P downstream ORF. 
5 SEQ ID NO:625 is the amino acid sequence of peptide #3535 of B726P. 

SEQ ID NO:626 is the amino acid sequence of peptide #3536 of B726P. 

SEQ ID NO:627 is the amino acid sequence of peptide #3534 of B726P. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed generally to compositions and their use 

10 in the therapy and diagnosis of cancer, particularly breast cancer. As described further 
below, illustrative compositions of the present invention include, but are not restricted 
to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding such 
polypeptides, antibodies and other bindmg agents, antigen presenting cells (APCs) and 
immune system cells (e.g., T cells). 

15 The practice of the present havention will employ, unless indicated 

specifically to the contrary, conventional methods of virology, immunology, 
microbiology, molecular biology and recombinant DNA techniques within the skill of 
the art, many of which are described below for the purpose of illustration. Such 
techniques are explained fiiUy in the literature. See, e.g., Sambrook, et al. Molecular 

20 Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: 
A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & n (D. 
Glover, ed.); OUgonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid 
Hybridization (B. Hames & S. Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, 

25 A Practical Guide to Molecular Cloning (1984). 

All publications, patents and patent applications cited herein, whether 
supra or infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the smgular forms 
"a," "an" and "the" include plural references unless the content clearly dictates 

30 otherwise. 
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Polypeptide Compositions 

As used herein, the term "polypeptide" " is used in its conventional 
meaning, z.e., as a sequence of amino acids. The polypeptides are not limited to a 
specific length of the product; thus, peptides, oligopeptides, and proteins are included 
5 within the definition of polypeptide, and such terms may be used interchangeably herein 
unless specifically indicated otherwise. This term also does not refer to or exclude post- 
expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurrmg. A polypeptide may be an entire 
10 protein, or a subsequence thereof. Particular polypeptides of interest in the context of 
this invention are amino acid subsequences comprising epitopes, /.e., antigenic 
determinants substantially responsible for the immunogenic properties of a polypeptide 
and being capable of evoking an immune response. 

Particularly illustrative polypeptides of the present invention comprise 
15 those encoded by a polynucleotide sequence set forth in any one of SEQ ID NO: 1-61, 
63-175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504-506, 510- 
513, 520-533, 548-550, 564, 566-569, and 576, or a sequence tiiat hybridizes under 
moderately stringent conditions, or, alternatively, under highly stringent conditions, to a 
polynucleotide sequence set fortii in any one of SEQ ID NO: 1-61, 63-175, 178, 180, 
20 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504-506, 510-513, 520-533, 548- 
550, 564, 566-569, and 576. Certain otiier illustrative polypeptides of the mvention 
comprise amino acid sequences as set forth in any one of SEQ ID NO: 62, 176, 179, 
181, 469-473, 475, 478, 483, 485, 487, 488, 493-503, 507-509, 514-519, 534-547, 551- 
553, 565, 570-573, and 577-627. 
25 The polypeptides of the present iavention are sometimes herein referred 

to as breast tumor proteins or breast tumor polypeptides, as an indication that their 
identification has been based at least hi part upon tiieir increased levels of expression in 
breast tumor samples. Thus, a "breast tumor polypeptide" or "breast tumor protein," 
refers generally to a polypeptide sequence of tiie present invention, or a polynucleotide 
30 sequence encoding such a polypeptide, tiiat is expressed in a substantial proportion of 
breast tumor samples, for example preferably greater than about 20%, more preferably 



wo 02/083956 



37 



PCT/US02/12378 



greater than about 30%, and most preferably greater than about 50% or more of breast 
tumor samples tested, at a level that is at least two fold, and preferably at least five fold, 
greater than the level of expression in normal tissues, as determined using a 
representative assay provided herein. A breast tumor polypeptide sequence of the 
5 invention, based upon its increased level of expression in tumor cells, has particular 
utility both as a diagnostic marker as well as a therapeutic target, as further described 
below. 

In certain preferred embodiments, the polypeptides of the invention are 
immunogenic, i.e., they react detectably within an immunoassay (such as an ELISA or 

10 T-cell stunulation assay) with antisera and/or T-cells from a patient with breast cancer. 
Screening for unmunogenic activity can be performed using techniques well known to 
the skilled artisan. For example, such screens can be performed using methods such as 
those described in Harlow and Laae, Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 

15 immobilized on a soUd support and contacted with patient sera to allow bmding of 
antibodies within the sera to the immobilized polypeptide. Unboimd sera may then be 
removed and bound antibodies detected using, for example, ^^^I-labeled Protein A. 

As would be recognized by the skilled artisan, immimogenic portions of 
the polypeptides disclosed herein are also encompassed by the present invention. An 

20 "immunogenic portion," as used herein, is a fragment of an unmunogenic polypeptide 
of the invention that itself is immunologically reactive (i.e., specifically bmds) with the 
B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 
Immunogenic portions may generally be identified using well knoAvn techniques, such 
as those sunmiarized m Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 

25 1993) and references cited therein. Such techniques include screening polypeptides for 
the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or 
clones. As used herein, antisera and antibodies are "antigen-specific" if they 
specifically bind to an antigen (i.e., they react with the protein in an ELISA or otiier 
immunoassay, and do not react detectably with unrelated proteins). Such antisera and 

30 antibodies may be prepared as described herein, and using well-known techniques. 
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In one preferred embodiment, an immunogenic portion of a polypeptide 
of the present invention is a portion that reacts with antisera and/or T-cells at a level that 
is not substantially less than the reactivity of the full-length polypeptide (e.g., in an 
ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of 

5 the unmunogenic portion is at least about 50%, preferably at least about 70% and most 
preferably greater than about 90% of the immunogenicity for the full-length 
polypeptide. In some instances, preferred immunogenic portions will be identified that 
have a level of immunogenic activity greater than that of the corresponding full-length 
polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 

10 activity. 

In certain other embodiments, illustrative inamunogenic portions may 
mclude peptides in which an N-terminal leader sequence and/or transmembrane domain 
have been deleted. Other illustrative immunogenic portions will contain a small N- 
and/or C-terminal deletion (e.g., 1-30 amino acids, preferably 5-15 ammo acids), 
1 5 relative to the mature protein. 

hi another embodhnent, a polypeptide composition of the invention may 
also comprise one or more polypeptides that are immunologically reactive with T cells 
and/or antibodies generated against a polypeptide of the mvention, particularly a 
polypeptide having an ammo acid sequence disclosed herein, or to an immunogenic 

20 fragment or variant thereof. 

hi another embodiment of the uivention, polypeptides are provided that 
comprise one or more polypeptides that are capable of eliciting T cells and/or antibodies 
that are_immunologically reactive with one or more polypeptides described herein, or 
one or more polypeptides encoded by contiguous nucleic acid sequences contained in 
25 the polynucleotide sequences disclosed herein, or immunogenic fragments or variants 
thereof, or to one or more nucleic acid sequences which hybridize to one or more of 
these sequences under conditions of moderate to high strmgency. 

The present invention, in another aspect, provides polypeptide fragments 
comprismg at least about 5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, 
30 including all intermediate lengths, of a polypeptide compositions set forth herem, such 
as those set forth in SEQ ID NO: 62, 176, 179, 181, 469-473, 475, 478, 483, 485, 487, 



wo 02/083956 



39 



PCTAJS02/12378 



488, 493-503, 507-509, 514-519, 534-547, 551-553, 565, 570-573, and 577-627, or 
those encoded by a polynucleotide sequence set forth in a sequence of SEQ ED NO: 1- 
61, 63-175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504-506, 
510-513, 520-533, 548-550, 564, 566-569, and 576. 
5 In another aspect, the present invention provides variants of the 

polypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%, 75%, 
80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identity 
(determined as described below), along its length, to a polypeptide sequences set forth 
10 herein. 

In one preferred embodiment, the polypeptide fragments and variants 
provide by the present invention are immunologically reactive with an antibody and/or 
T-cell that reacts with a full-length polypeptide specifically set for the herein. 

In another preferred embodiment, the polypeptide jfragments and variants 
15 provided by the present invention exhibit a level of immunogenic activity of at least 
about 50%, preferably at least about 70%, and most preferably at least about 90% or 
more of that exhibited by a full-length polypeptide sequence specifically set forth 
herein. 

A polypeptide 'Variant," as the term is used herein, is a polypeptide that 
20 typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 
occurring or may be synthetically generated, for example, by modifying one or more of 
the above polypeptide sequences of the invention and evaluating their immunogenic 
activity as described herein and/or using any of a number of techniques well known in 
25 the art. 

For example, certain illustrative variants of the polypeptides of the 
invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 
include variants in which a small portion {e.g., 1-30 amino acids, preferably 5-15 amino 
30 acids) has been removed from the N- and/or C-ternunal of the mature protein. 
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In many instances, a variant will contain conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 
5 polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of the polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide 
with desirable chamcteristics, e.g., with immimogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 

10 even an improved, immunogenic variant or portion of a polypeptide of the invention, 
one skilled in the art will typically change one or more of the codons of the encoding 
DNA sequence according to Table 1. 

For example, certain amino acids may be substituted for other amino 
acids in a protein structure without appreciable loss of interactive binding capacity with 

15 structures such as, for example, antigen-binding regions of antibodies or binding sites 
on substrate molecules. Since it is the interactive capacity and nature of a protein that 
defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its imderlying DNA 
coding sequence, and nevertheless obtain a protein with like properties. It is thus 

20 contemplated that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 
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Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


UUU 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Ghi 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


ucc 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











In making such changes, the hydropathic index of amino acids may be 
5 considered. The importance of the hydropathic amino acid index in conferring 
interactive biologic fimction on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other 
10 molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
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isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+L9); alanine (+1.8); glycme (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-13); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutatnine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 
5 It is known in the art that certain amino acids may be substituted by other 

amino acids having a similar hydropathic index or score and still result in a protein with 
similar biological activity, Le. still obtain a biological functionally equivalent protein. 
In making such changes, the substitution of amino acids whose hydropathic indices are 
within ±2 is preferred, those withm ±1 are particularly preferred, and those within ±0.5 

10 are even more particularly preferred. It is also understood in the art that the substitution 
of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 
greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 
its adjacent amino acids, correlates with a biological property of the protein. 

15 As detailed in U. S. Patent 4,554,101, the following hydrophiUcity values 

have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagme (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 ± 1); alanine (--0.5); histidine (-0.5); cysteine 
(-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 

20 2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an unmunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within ±2 
is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even 

25 more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 
take various of the foregoing characteristics into consideration are well known to those 

30 of skill in the art and include: arginme and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 
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In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or T 0-methyl 
rather than phosphodiesterase linkages in the backbone; and/or the inclusion of 
nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and 
uridine. 

Amino acid substitutions may further be made on the basis of similarity 
in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 
nature of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids with uncharged polar head groups having similar hydrophilicity values 
include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 
represent conservative changes include: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, 
his. A variant may also, or alternatively, contain nonconservative changes. In a 
preferred embodiment, variant polypeptides differ from a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 
alternatively) be modified by, for example, the deletion or addition of amino acids that 
have minimal influence on the immunogenicity, secondary structure and hydropathic 
nature of the polypeptide. 

As noted above, polypeptides may comprise a signal (or leader) sequence 
at the N-terminal end of the protein, which co-translationally or post-translationally 
directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
other sequence for ease of synthesis, purification or identification of the polypeptide 
(e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For 
example, a polypeptide may be conjugated to an immunoglobulin Fc region. 

When comparing polypeptide sequences, two sequences are said to be 
"identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
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sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
5 reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
10 alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-^45 Methods in Emymology 
15 vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and MuUer W. (1988) CABIOS ^:1M7; Robinson, 
E.D. (1971) Comb, Theor 11:105; Santou, N. Nes, M. (1987) Mol Biol Evol 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 
20 Lipman, D.J. (1983) Proc. Natl Acad, Set USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add APL 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol Biol 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
25 Proc. Natl Acad Set USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
30 percent sequence identity and sequence shnilarity are the BLAST and BLAST 2.0 
algorithms, which are described m Altschul et al. (1977) Nucl Acids Res, 25:3389-3402 
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and Altschid et al. (1990) I Mol Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to detemiine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 
5 Biotechnology Information, For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 

10 reached. The BLAST algorithm parameters W, T and X determine the sensitivity and 
speed of the alignment. 

In one preferred approach, the "percentage of sequence identity" is 
determined by comparing two optimally aligned sequences over a window of 
comparison of at least 20 positions, wherein the portion of the polypeptide sequence in 

15 the comparison window may comprise additions or deletions (/.e., gaps) of 20 percent 
or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 
which the identical amino acid residue occurs in both sequences to yield the number of 

20 matched positions, dividmg the number of matched positions by the total number of 
positions in the reference sequence {i,e., the window size) and multiplying the results by 
100 to yield the percentage of sequence identity. 

Within other illustrative embodiments, a polypeptide may be a fiision 
polypeptide that comprises multiple polypeptides as described herein, or that comprises 

25 at least one polypeptide as described herein and an umelated sequence, such as a known 
tumor protein. A fusion partner may, for example, assist hi providing T helper epitopes 
(an unmunological fiision partner), preferably T helper epitopes recognized by humans, 
or may assist m expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fusion partners are both immimological 

30 and expression enhancing fusion partners. Other fusion partners may be selected so as 
to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to 
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desired intracellular compartments. Still further fusion partners include afJSnity tags, 
which faciUtate purification of the polypeptide. 

Fusion polypeptides may generally be prepared using standard 
techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
5 expressed as a recombinant polypeptide, allowing the production of mcreased levels, 
relative to a non-fused polypeptide, in an expression system. Briefly, DNA sequences 
encodmg the polypeptide components may be assembled separately, and ligated into an 
appropriate expression vector. The 3* end of the DNA sequence encoding one 
polypeptide component is ligated, with or without a peptide linker, to the 5' end of a 
10 DNA sequence encoding the second polypeptide component so that the readmg fi^es 
of the sequences are in phase. This permits translation into a smgle fusion polypeptide 
that retains the biological activity of both component polypeptides. 

A peptide linker sequence may be employed to separate the first and 
second polypeptide components by a distance sufficient to ensure that each polypeptide 
15 folds into its secondary and tertiary structures. Such a peptide linker sequence is 
mcorporated mto the fiision polypeptide using standard techniques weU known in the 
art. Suitable peptide linker sequences may be chosen based on the followmg factors: 
(1) their abiUty to adopt a flexible extended conformation; (2) their inability to adopt a 
secondary structure that could interact with fimctional epitopes on the first and second 
20 polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
used in the linker sequence. Amino acid sequences which may be usefully employed as 
linkers mclude those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., 
25 Proc. Natl Acad. ScL USA 85:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 
Patent No. 4,751,180. The linker sequence may generally be firom 1 to about 50 amino 
acids m length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-termmal amino acid regions that can be used to 
separate the functional domains and prevent steric mterference. 
30 The ligated DNA sequences are operably linked to suitable 

transcriptional or translational regulatory elements. The regulatory elements 
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responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 
transcription termination signals are only present 3' to the DNA sequence encoding the 
second polypeptide. 

5 The fusion polypeptide can comprise a polypeptide as described herein 

together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins {see, for example, Stoute et al. New Engl 7. Med., 
536:86-91, 1997). 

10 ha one prefened embodiment, the immunological fusion partner is 

derived firom a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
firagment. Ral2 compositions and methods for their use m enhancing the expression 
and/or immimogenicity of heterologous polynucleotide/polypeptide sequences is 
described m U.S. Patent Application 60/158,585, the disclosure of which is 

15 incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32 A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 
and avirulent strains of M tuberculosis. The nucleotide sequence and amino acid 
sequence of MTB32A have been described (for example, U.S. Patent Application 

20 60/158,585; see also, Skeiky et al, Infection and Immun, (1999) 67:3998-4007, 
incorporated herem by reference). C-tenninal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the 
purification process. Moreover, Ral2 may enhance the immunogenicity of heterologous 
immunogenic polypeptides with which it is fused. One preferred Ral2 fusion 

25 polypeptide comprises a 14 KD C-terminal firagment corresponding to amino acid 
residues 192 to 323 of MTB32A. Other preferred Ral2 polynucleotides generally 
comprise at least about 15 consecutive nucleotides, at least about 30 nucleotides, at least 
about 60 nucleotides, at least about 100 nucleotides, at least about 200 nucleotides, or at 
least about 300 nucleotides that encode a portion of a Ral2 polypeptide. Ral2 

30 polynucleotides may comprise a native sequence (z.e., an endogenous sequence that 
encodes a Ral2 polypeptide or a portion thereof) or may comprise a variant of such a 
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sequence. Ral2 polynucleotide variants may contain one or more substitutions, 
additions, deletions and/or insertions such that the biological activity of the encoded 
fusion polypeptide is not substantially diminished, relative to a fusion polypeptide 
comprising a native Ral2 polypeptide. Variants preferably exhibit at least about 70% 
5 identity, more preferably at least about 80% identity and most preferably at least about 
90% identity to a polynucleotide sequence that encodes a native Ral2 polypeptide or a 
portion thereof. 

Within other preferred embodiments, an immunological fusion partner is 
derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 

10 influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein (e.g., the first N-terminal 100-110 amino 
acids), and a protein D derivative may be lipidated. Within certain preferred 
embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 

15 increase the expression level in E, coli (thus fimctioning as an expression enhancer). 
The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fusion partners mclude the non-structural protein from influenzae virus, NSl 
(hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

20 In another embodiment, the immunological fusion partner is the protein 

known as LYTA, or a portion thereof (preferably a C-termmal portion). LYTA is 
derived from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine 
amidase known as amidase LYTA (encoded by the LytA gene; Gene 45:265-292, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 

25 backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to 
the choline or to some choline analogues such as DEAE. This property has been 
exploited for the development of E. coli C-LYTA expressing plasmids useful for 
expression of fusion proteins. Purification of hybrid protems containing the C-LYTA 
fragment at the amino terminus has been described {see Biotechnology i 0:795-798, 

30 1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated 
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into a fusion polypeptide. A repeat portion is found in the C-terminal region starting at 
residue 178. A particularly preferred repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fiision polypeptides, and 
the polynucleotides encoding them, wherein the fusion partner comprises a targeting 
5 signal capable of directing a polypeptide to the endosomal/lysosomal compartment, as 
described in U.S. Patent No. 5,633,234. An immunogenic polypeptide of the invention, 
when fiised with this targeting signal, will associate more efBciently with MHC class II 
molecules and thereby provide enhanced in vivo stimulation of CD4^ T-cells speciiSc 
for the polypeptide. 

1 0 Polypeptides of the invention are prepared using any of a variety of well 

known synthetic and/or recombinant techniques, the latter of which are further 
described below. Polypeptides, portions and other variants generally less than about 
150 amino acids can be generated by synthetic means, using techniques well known to 
those of ordinary skill in the art. In one illustrative example, such polypeptides are 

1 5 synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain. See Merrifield, J, Am. Chem. Soc. 85:2149-2146, 1963. 
Equipment for automated synthesis of polypeptides is commercially available from 
suppliers such as Perkin Ehner/Applied BioSystems Division (Foster City, CA), and 

20 may be operated according to the manufacturer's instructions. 

In general, polypeptide compositions (including fusion polypeptides) of 
the invention are isolated. An "isolated" polypeptide is one that is removed from its 
original environment. For example, a naturally-occurring protein or polypeptide is 
isolated if it is separated from some or all of the coexisting materials m the natural 

25 system. Preferably, such polypeptides are also purified, e.g., are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
pure. 

Pol ynucleotide Compositions 

The present invention, in other aspects, provides polynucleotide 
30 compositions. The terms "DNA" and "polynucleotide" are used essentially 
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interchangeably herein to refer to a DNA molecule that has been isolated free of total 
genomic DNA of a particular species. "Isolated," as used herein, means that a 
polynucleotide is substantially away from other coding sequences, and that the DNA 
molecule does not contain large portions of unrelated coding DNA, such as large 

5 chromosomal fragments or other functional genes or polypeptide coding regions. Of 
course, this refers to the DNA molecule as originally isolated, and does not exclude 
genes or coding regions later added to the segment by the hand of man. 

As will be understood by those skilled in the art, the polynucleotide 
compositions of this invention can include genomic sequences, extra-genomic and 

10 plasmid-encoded sequences and smaller engineered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments may 
be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 
invention may be single-stranded (coding or antisense) or double-stranded, and may be 

15 DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules may include 
HnRNA molecules, which contain mtrons and correspond to a DNA molecule in a one- 
to-one maimer, and mRNA molecules, which do not contain introns. Additional coding 
or non-codmg sequences may, but need not, be present within a polynucleotide of the 
present invention, and a polynucleotide may, but need not, be linked to other molecules 

20 and/or support materials. 

Polynucleotides may comprise a native sequence (i.e., an endogenous 
sequence that encodes a polypeptide/protein of the invention or a portion thereof) or 
may comprise a sequence that encodes a variant or derivative, preferably and 
immunogenic variant or derivative, of such a sequence. 

25 Therefore, according to another aspect of the present invention, 

polynucleotide compositions are provided that comprise some or all of a polynucleotide 
sequence set forth in any one of SEQ ID NO: 1-61, 63-175, 178, 180, 182-468, 474, 
476, 477, 479, 482, 484, 486, 489-492, 504-506, 510-513, 520-533, 548-550, 564, 566- 
569, and 576, complements of a polynucleotide sequence set forth in any one of SEQ ID 

30 NO: 1-61, 63-175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504- 
506, 510-513, 520-533, 548-550, 564, 566-569, and 576, and degenerate variants of a 
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polynucleotide sequence set forth in any one of SEQ ID NO: 1-61, 63-175, 178, 180, 
182-468, 474, 476, 477, 479, 482, 484, 486, 489-492, 504-506, 510-513, 520-533, 548- 
550, 564, 566-569, and 576. In certain preferred embodiments, the polynucleotide 
sequences set forth herein encode immunogenic polypeptides, as described above. 
5 In other related embodiments, the present invention provides 

polynucleotide variants haviug substantial identity to the sequences disclosed herein in 
SEQ ID NO: 1-61, 63-175, 178, 180, 182-468, 474, 476, 477, 479, 482, 484, 486, 489- 
492, 504-506, 510-513, 520-533, 548-550, 564, 566-569, and 576, for example those 
comprising at least 70% sequence identity, preferably at least 75%, 80%, 85%, 90%, 

10 95%, 96%, 97%, 98%, or 99% or higher, sequence identity compared to a 
polynucleotide sequence of this invention using the methods described herein, {e.g., 
BLAST analysis using standard parameters, as described below). One skilled in this art 
will recognize that these values can be appropriately adjusted to determine 
corresponding identity of proteins encoded by two nucleotide sequences by taking mto 

15 account codon degeneracy, amino acid similarity, reading frame positioning and the 
like. 

Typically, polynucleotide variants will contain one or more substitutions, 
additions, deletions and/or insertions, preferably such that the immunogenicity of the 
polypeptide encoded by the variant polynucleotide is not substantially diminished 

20 relative to a polypeptide encoded by a polynucleotide sequence specifically set forth 
herein). The term 'Variants" should also be understood to encompasses homologous 
genes of xenogenic origin. 

In additional embodiments, the present invention provides 
polynucleotide fragments comprising various lengths of contiguous stretches of 

25 sequence identical to or complementary to one or more of the sequences disclosed 
herein. For example, polynucleotides are provided by this invention that comprise at 
least about 10, 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 
contiguous nucleotides of one or more of the sequences disclosed herein as well as all 
intermediate lengths there between. It will be readily understood that "intermediate 

30 lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, 
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etc.; 150, 151, 152, 153, etc.; including all integers through 200-500; 500-1,000, and the 
Hke. 

In another embodiment of the invention, polynucleotide compositions axe 
provided that are capable of hybridi2ang under moderate to high stringency conditions to 

5 a polynucleotide sequence provided herein, or a fragment thereof, or a complementary 
sequence Aereof. Hybridization techniques are well known in the art of molecular 
biology. For purposes of illustration, suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other polynucleotides 
include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); 

10 hybridizing at SO^C-eO-C, 5 X SSC, overnight; foUowed by washing twice at 65°C for 
20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. One skilled in 
the art will understand that the stringency of hybridization can be readily manipulated, 
such as by altering the salt content of the hybridization solution and/or the temperature 
at which the hybridization is performed. For example, in another embodiment, suitable 

15 highly stringent hybridization conditions include those described above, with the 
exception that the temperature of hybridization is increased, e.g., to 60-65°C or 65- 
70°C. 

In certain preferred embodiments, the polynucleotides described above, 
e.g., polynucleotide variants, fragments and hybridizing sequences, encode polypeptides 
20 that are immunologicaUy cross-reactive with a polypeptide sequence specifically set 
forth herein. hi other preferred embodiments, such polynucleotides encode 
polypeptides that have a level of immunogenic activity of at least about 50%, preferably 
at least about 70%, and more preferably at least about 90% of that for a polypeptide 
sequence specifically set forth herein. 

25 The polynucleotides of the present invention, or firagments thereof, 

regardless of the length of the coding sequence itself, may be combined with other DNA 
sequences, such as promoters, polyadenylation signals, additional restriction enzyme 
sites, multiple cloning sites, other coding segments, and the like, such that tiieir overall 
length may vary considerably. It is tiierefore contemplated that a nucleic acid fragment 

30 of ahnost any length may be employed, with tiie total length preferably being lunited by 
the ease of preparation and use in the intended recombinant DNA protocol. For 
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example, illustrative polynucleotide segments with total lengths of about 10,000, about 
5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 
base pairs in length, and the like, (including all intermediate lengths) are contemplated 
to be useful in many implementations of this inventioiL 
5 When comparing polynucleotide sequences, two sequences are said to be 

"identical" if the sequence of nucleotides m the two sequences is the same when aligned 
for maximiun correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 

10 window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 

15 the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
aligmnent schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 

20 Research Foundation, Washuigton DC Vol. 5, SuppL 3, pp. 345-358; Hein J. (1990) 
UniJBed Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, RW. and Muller W, (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor i/:105; Santou, N. Nes, M. (1987) Mol Biol EvoL 4:406- 

25 425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, WJ. and 
Lipman, D.J. (1983) Proc. Natl, Acad,, Sci. USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL 

30 Math 2:482, by the identity aligmnent algorithm of Needleman and Wunsch (1970) J. 
Mol BioL 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
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Proc. Natl Acad Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFTT, BLAST, FASTA, and TFASTA m the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

5 One preferred example of algorithms that are suitable for determining 

percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl Acids Res. 25:3389-3402 
and Altschul et al. (1990) I Mol Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 

10 sequence identity for the polynucleotides of the invention. Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology 
Information. In one illustrative example, cumulative scores can be calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 

15 the word hits in each direction are halted when: the cumulative alignment score falls off 
by the quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue aligranents; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment. The BLASTN program (for 

20 nucleotide sequences) uses as defaults a wordlength (W) of 1 1, and expectation (E) of 
10, and the BLOSUM62 scoriing matrix (see Henikoff and Henikoff (1989) Proc. Natl 
Acad Scl USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and 
a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 

25 comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (/.e., gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 

30 percentage is calculated by determining the number of positions at which the identical 
nucleic acid bases occurs in both sequences to yield the number of matched positions. 
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dividing the number of matched positions by the total number of positions in the 
reference sequence (i.e., the window size) and multiplying the results by 100 to yield the 
percentage of sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a result 
5 of the degeneracy of the genetic code, there are many nucleotide sequences that encode 
a polypeptide as described herein. Some of these polynucleotides bear minimal 
homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
that vary due to differences in codon usage are specifically contemplated by the present 
invention. Further, alleles of the genes comprising the polynucleotide sequences 

10 provided herein are withm the scope of the present invention. Alleles are endogenous 
genes that are altered as a result of one or more mutations, such as deletions, additions 
and/or substitutions of nucleotides. The resulting mRNA and protein may, but need not, 
have an altered structure or function. Alleles may be identified using standard 
techniques (such as hybridization, ampliJBication and/or database sequence comparison). 

15 Therefore, in another embodiment of the invention, a mutagenesis 

approach, such as site-specific mutagenesis, is employed for the preparation of 
irrimunogenic variants and/or derivatives of the polypeptides described herein. By this 
approach, specific modifications in a polypeptide sequence can be made through 
mutagenesis of the imderlying polynucleotides that encode them. These techniques 

20 provides a straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 
nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the 
use of specific oligonucleotide sequences which encode the DNA sequence of the 

25 desu*ed mutation, as well as a sufficient nxmiber of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on 
both sides of the deletion junction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
change the properties of the polynucleotide itself, and/or alter the properties, activity, 

30 composition, stability, or primary sequence of the encoded polypeptide. 
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In certain embodiments of the present invention, the inventors 
contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
more properties of the encoded polypeptide, such as the immunogenicity of a 
polypeptide vaccine. The techniques of site-specific mutagenesis are well-known in the 
5 art, and are widely used to create variants of both polypeptides and polynucleotides. For 
example, site-specific mutagenesis is often used to alter a specific portion of a DNA 
molecule. In such embodiments, a primer comprising typically about 14 to about 25 
nucleotides or so in length is employed, with about 5 to about 10 residues on both sides 
of the junction of the sequence being altered. 
10 As will be appreciated by those of skill in the art, site-specific 

mutagenesis techniques have often employed a phage vector that exists in both a single 
stranded and double stranded form. Typical vectors useful in site-directed mutagenesis 
include vectors such as the Ml 3 phage. These phage are readily commercially-available 
and their use is generally well-known to those skilled in the art. Double-stranded 
15 plasmids are also routmely employed in site directed mutagenesis that eliminates the 
step of transferring the gene of interest from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is 
performed by first obtaining a single-stranded vector or melting apart of two strands of a 
double-stranded vector that includes within its sequence a DNA sequence that encodes 
20 the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 
vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I 
Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
25 sequence and the second strand bears the desired mutation. This heteroduplex vector is 
then used to transform appropriate cells, such as E. coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected peptide-encoding 
DNA segments using site-directed mutagenesis provides a means of producing 
30 potentially usefiil species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and the DNA sequences encodmg them may be 
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obtained. For example, recombinant vectors encoding the desired peptide sequence 
may be treated with mutagenic agents, such as hydroxylamme, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy etal, 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
5 Maniatis et al , 1982, each incorporated herein by reference, for that purpose. 

As used herein, the term "oligonucleotide directed mutagenesis 
procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid molecule 
relative to its initial concentration, or m an increase in the concentration of a detectable 
10 signal, such as amplification. As used herein, the term "oligonucleotide dkected 
mutagenesis procedure" is intended to refer to a process that involves the 
template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 
15 rules of complementary base pairing (see, for example, Watson, 1987), Typically, 
vector mediated methodologies involve the introduction of the nucleic acid firagment 
into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of 
the amplified nucleic acid fragment. Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 
20 In another approach for the production of polypeptide variants of the 

present invention, recursive sequence recombination, as described in U.S. Patent No. 
5,837,458, may be employed. In this approach, iterative cycles of recombination and 
screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. 
25 In other embodiments of the present invention, the polynucleotide 

sequences provided herein can be advantageously used as probes or primers for nucleic 
acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
a sequence region of at least about 15 nucleotide long contiguous sequence that has the 
same sequence as, or is complementary to, a 15 nucleotide long contiguous sequence 
30 disclosed herein vwll find particular utility. Longer contiguous identical or 
complementary sequences, e,g,, those of about 20, 30, 40, 50, 100, 200, 500, 1000 
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(including all intermediate lengths) and even up to full length sequences will also be of 
use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to a 
sequence of interest Avill enable them to be of use in detecting the presence of 
5 complementary sequences in a given sample. However, other uses are also envisioned, 
such as the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 

10 or so (includmg intermediate lengths as well), identical or complementary to a 
polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e.g.. Southern and Northern blotting. This would allow a gene 
product, or fragment thereof, to be analyzed, both in diverse cell types and also in 
various bacterial cells. The total size of fragment, as well as the size of the 

1 5 complementary stretch(es), will ultimately depend on the intended use or application of 
the particular nucleic acid segment. Smaller fragments will generally find use in 
hybridization embodiments, wherein the length of the contiguous complementary region 
may be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 

20 complementary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 
allows the formation of a duplex molecule that is both stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than 15 bases in 
length are generally preferred, though, in order to increase stability and selectivity of the 

25 hybrid, and thereby improve the quality and degree of specific hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules having gene- 
complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

Hybridization probes may be selected from any portion of any of the 
30 sequences disclosed herein. All that is required is to review the sequences set forth 
herein, or to any contuiuous portion of the sequences, from about 15-25 nucleotides in 
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length up to and including the full length sequence, that one wishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
factors. For example, one may wish to employ primers from towards the termini of the 
total sequence. 

5 Small polynucleotide segments or fragments may be readily prepared by, 

for example, directly synthesizing the fragment by chemical means, as is conmionly 
practiced using an automated oUgonucleotide synthesizer. Also, fragments may be 
obtained by appUcation of nucleic acid reproduction technology, such as the PGR™ 
technology of U. S. Patent 4,683,202 (incorporated herein by reference), by introducing 
10 selected sequences into recombinant vectors for recombinant production, and by other 
recombinant DNA techniques generally known to those of skill in the art of molecular 
biology. 

The nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of the entire gene or 

15 gene fi:agments of interest. Depending on tiie application envisioned, one will typically 
desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, 
one will typically desire to employ relatively stringent conditions to form the hybrids, 
e.g., one will select relatively low salt and/or high temperature conditions, such as 

20 provided by a salt concentration of from about 0.02 M to about 0.15 M salt at 
temperatures of from about 50°C to about 70°C. Such selective conditions tolerate 
little, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 

Of course, for some appUcations, for example, where one desires to 

25 prepare mutants employmg a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 
needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 
salt, at temperatures ranging from about 20°C to about 55°C. Cross-hybridizing species 

30 can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 
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more stringent by the addition of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 
5 According to another embodiment of the present invention, 

polynucleotide compositions comprising antisense oligonucleotides are provided. 
Antisense oligonucleotides have been demonstrated to be effective and targeted 
inhibitors of protein synthesis, and, consequently, provide a therapeutic approach by 
which a disease can be treated by inhibitmg the synthesis of proteins that contribute to 
10 the disease. The efficacy of antisense Qligonucleotides for mhibiting protein synthesis 
is well established. For example, the synthesis of polygalactauronase and the muscarine 
type 2 acetylcholine receptor are mhibited by antisense oligonucleotides directed to their 
respective mRNA sequences (U. S. Patent 5,739,119 and U. S. Patent 5,759,829). 
Further, examples of antisense inhibition have been demonstrated with the nuclear 
15 protein cyclin, the multiple drug resistance gene (MDGl), ICAM-1, E-selectin, STK-1, 
striatal GABAa receptor and human EGF (Jaskulski et al. Science. 1988 Jun 
10;240(4858):1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
32; Peris et aL, Brain Res Mol Brain Res. 1998 Jun 15;57(2):310-20; U. S. Patent 
5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patent 5,610,288). 
20 Antisense constructs have also been described that inhibit and can be used to treat a 
variety of abnormal cellular proliferations, e.g. cancer (U. S. Patent 5,747,470; U. S. 
Patent 5,591,317 andU. S. Patent 5,783,583). 

Therefore, in certain embodiments, the present invention provides 
oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
25 capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 
or derivatives thereof In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof In a third embodiment, the oligonucleotides are modified DNAs 
comprising a phosphorothioated modified backbone. In a fourth embodunent, the 
30 oUgonucleotide sequences comprise peptide nucleic acids or derivatives thereof In 
each case, preferred compositions comprise a sequence region that is complementary. 
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and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 
Selection of antisense compositions specific for a given gene sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, Tm, 
5 binding energy, and relative stability. Antisense compositions may be selected based 
upon their relative inability to form dimers, hairpins, or other secondary structures that 
would reduce or prohibit specific binding to the target mRNA in a host cell. Highly 
preferred target regions of the mKNA, are those which are at or near the AUG 
translation initiation codon, and those sequences which are substantially complementary 

10 to 5' regions of the mRNA. These secondary structure analyses and target site selection 
considerations can be performed, for example, using v.4 of the OLIGO primer analysis 
software and/or the BLASTN 2.0.5 algorithm software (Altschul et ah. Nucleic Acids 
Res. 1997, 25(17):33 89-402). 

The use of an antisense delivery method employing a short peptide 

15 vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 
hydrophobic domain derived firom the fusion sequence of HIV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SV40 T-antigen (Morris et aL, 
.Nucleic Acids Res. 1997 Jul 15;25(14):2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 

20 into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). 
Further, the interaction with MPG strongly increases both the stability of the 
oligonucleotide to nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the polynucleotide 
compositions described herein are used in the design and preparation of ribozyme 

25 molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor cells. Ribozymes are RNA-protein complexes that cleave 
nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 
possess endonuclease activity (Kim and Cech, Proc Natl Acad Sci USA, 1987 
Dec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;49(2):211-20). For 

30 example, a large number of ribozymes accelerate phosphoester transfer reactions with a 
high degree of specificity, often cleaving only one of several phosphoesters in an 
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oligonucleotide substrate (Cech et al, Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol Biol. 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1992 May 14;357(6374):173-6). This specificity has been attributed to the requirement 
that the substrate bind via specific base-pairing interactions to the internal guide 
5 sequence ("IGS") of the ribozyme prior to chemical reaction. 

Six basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 
thus can cleave other RNA molecules) imder physiological conditions. In general, 
enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 

10 through the target binding portion of a enzymatic nucleic acid which is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 
cut the target RNA. Strategic cleavage of such a target RNA vrill destroy its ability to 

15 direct synthesis of an encoded protein. After an enzymatic nucleic acid has boimd and 
cleaved its RNA target, it is released from that RNA to search for another target and can 
repeatedly bind and cleave new targets. 

The enzymatic nature of a ribozyme is advantageous over many 
technologies, such as antisense technology (where a nucleic acid molecule simply binds 

20 to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessary to aJBfect a therapeutic treatment is lower than that of an antisense 
oligonucleotide. This advantage reflects the ability of the ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of 
target RNA. La addition, the ribozyme is a highly specific inhibitor, with the specificity 

25 of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 
substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their action 
(Woolf etal, Proc Nati Acad Sci USA. 1992 Aug 15;89(16):7305-9). Thus, the 

30 specificity of action of a ribozyme is greater than that of an antisense oligonucleotide 
binding the same RNA site. 
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The enzymatic nucleic acid molecule may be formed in a hammerhead, 
hairpin, a hepatitis 6 virus, group I intron or RNaseP RNA (in association with an RNA 
guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
described by Rossi et al Nucleic Acids Res. 1992 Sep 1 1;20(17):4559-65. Examples of 

5 hairpm motifs are described by Hampel et al (Eur. Pat. Appl. Publ. No. EP 0360257), 
Hampel and Tritz, Biochemistry 1989 Jun 13;28(12):4929-33; Hampel etaU Nucleic 
Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An example of the 
hepatitis 8 virus motif is described by Perrotta and Been, Biochemistry. 1992 Dec 
1;3 1(47): 11843-52; an example of the RNaseP motif is described by Guerrier-Takada 

10 etal. Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RNA ribozyme motif is 
described by Collms (Saville and Collins, Cell. 1990 May 18;61(4):685-96; Saville and 
Collms, Proc Natl Acad Sci USA. 1991 Oct l;88(19):8826-30; Collins and Olive, 
Biochemistry. 1993 Mar 23;32(ll):2795-9); and an example of the Group I intron is 
described in (U. S. Patent 4,987,071). All that is important m an enzymatic nucleic acid 

15 molecule of this mvention is that it has a specific substrate binding site which is 
complementary to one or more of the target gene RNA regions, and that it have 
nucleotide sequences within or surrounding that substrate bmdmg site which unpart an 
RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be 
limited to specific motifs mentioned herein. 

20 Ribozymes may be designed as described in Int. Pat. Appl Publ No. 

WO 93/23569 and hit. Pat. Appl. Publ. No. WO 94/02595, each specifically 
mcoiporated herein by reference) and synthesized to be tested in vitro and in vivo, as 
described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those m the art will recognize that equivalent RNA targets ui 

25 other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 
ribozyme binding arms, or chemically synthesizmg ribozymes with modifications that 
prevent thek degradation by serum ribonucleases (see e.g.. Int. Pat. Appl. Publ. No. WO 
92/07065; Int. Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 

30 91/03162; Eur. Pat. Appl. Publ No. 92110298.4; U. S. Patent 5,334,711; and hit. Pat. 
Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can 
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be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efficacy in cells, and removal of stem n bases to shorten KN A synthesis 
times and reduce chemical requirements. 

SulUvan etal (hit. Pat. Appl. Publ. No. WO 94/02595) describes the 
5 general methods for delivery of enzymatic RNA molecules. Ribozymes may be 
administered to cells by a variety of methods known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 

10 directly dehvered ex vivo to cells or tissues with or without the aforementioned vehicles. 
Alternatively, the RNA/vehicle combination may be locally delivered by direct 
inhalation, by direct injection or by use of a catheter, infusion fiump or stent. Other 
routes of deUvery include, but are not Ihnited to, mtravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 

15 systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 
of ribozyme delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 
94/02595 and hit. Pat. Appl. Publ. No. WO 93/23569, each specifically mcorporated 
herein by reference. 

Another means of accumulating high concentrations of a ribozyme(s) 

20 within cells is to mcorporate the ribozyme-encoding sequences into a DNA expression 
vector. Transcription of the ribozyme sequences are driven from a promoter for 
eukaryotic RNA polymerase I (pol I), RNA polymerase n (pol H), or RNA polymerase 
IE (pol ni). Transcripts from pol n or pol III promoters will be expressed at high levels 
in ail ceUs; the levels of a given pol II promoter in a given cell type will depend on the 

25 nature of the gene regulatory sequences (enhancers, silencers, etc) present nearby. 
Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed m the appropriate cells Ribozymes 
expressed from such promoters have been shown to function in mammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction mto 

30 mammalian cells, including but not restricted to, plasmid DNA vectors, vural DNA 
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vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as 
retroviral, semliki forest virus, sindbis virus vectors). 

In another embodiment of the invention, peptide nucleic acids (PNAs) 
compositions are provided. PNA is a DNA mimic in which the nucleobases are 
5 attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 
Dev. 1997 7(4) 431-37). PNA is able to be utiUzed in a number metiiods that 
traditionally have used RNA or DNA. Often PNA sequences perform better in 
techniques than the corresponding RNA or DNA sequences and have utilities that are 
not inherent to RNA or DNA. A review of PNA including methods of making, 
10 characteristics of, and methods of using, is provided by Corey {Trends Biotechnol 1997 
Jun;15(6):224-9). As such, in certam embodiments, one may prepare PNA sequences 
that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 
1 5 such PNA compositions have been administered. 

PNAs have 2-aminoethyl-glycine linkages replacing the normal 
phosphodiester backbone of DNA (Nielsen et aL, Science 1991 Dec 6;254(5037):1497- 
500; Hanvey et al. Science. 1992 Nov 27;258(5087):1481-5; Hyrup and Nielsen, 
Bioorg Med Chem. 1996 Jan;4(l):5-23). This chemistry has three important 
20 consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs 
are neutral molecules; secondly, PNAs are achiral, which avoids the need to develop a 
stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc or Fmoc 
protocols for soUd-phase peptide synthesis, although other methods, including a 
modified Merrifield method, have been used. 
25 PNA monomers or ready-made oligomers are commercially available 

firom PerSeptive Biosystems (Framiagham, MA). PNA syntheses by either Boc or 
Fmoc protocols are straightforward using manual or automated protocols (Norton et al, 
Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
production of chemically modified PNAs or the simultaneous synthesis of famiUes of 
3 0 closely related PNAs. 
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As with peptide synthesis, the success of a particular PNA synthesis will 
depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
can lead to deletions of one or more residues in the product. In expectation of this 
5 difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be followed 
by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 
providing yields and purity of product similar to those observed during the synthesis of 
peptides. 

10 Modifications of PNAs for a given application may be accomplished by 

coupling amino acids during soUd-phase synthesis or by attaching compounds that 
contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility or 

15 for specific functional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 
utilized modifications of PNAs (for example, Norton et aL, Bioorg Med Chem. 1995 
Apr;3(4):437-45; Petersen et al, J Pept Sci. 1995 May-Jun; 1(3): 175-83; Orum et ah, 
Biotechniques. 1995 Sep;19(3):472-80; Footer et al. Biochemistry. 1996 Aug 

20 20;35(33):10673-9; Griffith et al. Nucleic Acids Res. 1995 Aug ll;23(15):3003-8; 
Pardridge et al, Proc Natl Acad Sci USA. 1995 Jun 6;92(12):5592-6; Boffa et al, 
Proc Natl Acad Sci U S A. 1995 Mar 14;92(6):1901-5; Gambacorti-Passerini et al. 
Blood. 1996 Aug 15;88(4):1411-7; Armitage et al, Proc Natl Acad Sci USA. 1997 
Nov ll;94(23):12320-5; Seeger et al, Biotechniques. 1997 Sep;23(3):512-7). U.S. 

25 Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. 

Methods of characterizing the antisense binding properties of PNAs are 
discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et al 
30 (Biochemistry. 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring the 
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relative binding kinetics and stoichiometry. Similar types of measurements were made 
by Jensen et al using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 
apparent to the skilled artisan include use in DNA strand invasion, antisense inhibition, 
5 mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
transcriptionally active genes, blocking of transcription factor binding, genome 
cleavage, biosensors, in situ hybridization, and the like. 

Polynucleotide Identification. Characterization and Expression 

Polynucleotides compositions of the present mvention may be identified, 

10 prepared and/or manipulated using any of a variety of well established techniques (see 
generally, Sambrook et al.. Molecular Cloning: A Laboratory Manual Cold Spring 
Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described in more detail below, by 
screening a microarray of cDNAs for tumor-associated expression (i,e„ expression that 

15 is at least two fold greater in a tumor than in normal tissue, as detemiined using a 
representative assay provided hereui). Such screens may be performed, for example, 
using the microarray technology of Aflfymetrix, Inc. (Santa Clara, CA) according to the 
manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl. 
Acad Sci. USA 9i:10614-10619, 1996 and Heller et al., Proc, Natl. Acad Sci. USA 

20 9^:2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA 
prepared fi-om cells expressing the proteins described herein, such as tumor cells. 

Many template dependent processes are available to amplify a target 
sequences of interest present m a sample. One of the best known amplification methods 
is the polymerase chain reaction (PCR™) which is described in detail in U.S. Patent 

25 Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by 
reference in its entkety. Briefly, in PCR™, two primer sequences are prepared which 
are complementary to regions on opposite complementary strands of the target 
sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g.. Tag polymerase). If the target sequence is present 

30 in a sample, die primers will bind to the target and the polymerase will cause the 
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primers to be extended along the target sequence by adding on nucleotides. By raising 
and lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
5 transcription and PCR^" amplification procedure may be performed in order to quantify 
the amount of roRNA amplified. Polymerase chain reaction methodologies are well 
known in the art. 

Any of a mmiber of other template dependent processes, many of which 
are variations of the PCR ™ amplification technique, are readily known and available in 

10 the art Illustratively, some such methods include the ligase cham reaction (referred to 
as LCR), described, for example, in Eur. Pat. Appl. Publ. No. 320,308 and U.S. Patent 
No. 4,883,750; Qbeta Replicase, described m PCT Intl. Pat. Appl. Publ. No. 
PCT/US87/00880; Strand Displacement Amplification (SDA) and Repair Chain 
Reaction (RCR). Still other amplification methods are described in Great Britain Pat. 

15 Appl. No. 2 202 328, and in PCT Ml Pat. Appl. Publ. No. PCT/US 89/0 1025. Other 
nucleic acid amplification procedures iuclude transcription-based amplification systems 
(TAS) (PCT Intl. Pat. Appl. Publ. No. WO 88/10315), mcluding nucleic acid sequence 
based amplification (NASBA) and 3SR. Eur. Pat. Appl. Publ. No. 329,822 describes a 
nucleic acid amplification process involving cyclically synthesizing single-stranded 

20 RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). PCT hitl. Pat. Appl. 
Publ. No, WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hybridization of a promoter/primer sequence to a target single-stranded DNA 
("ssDNA") followed by transcription of many RNA copies of the sequence. Other 
amplification methods such as "RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 

25 1989) are also well-known to those of skill in the art. 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a fiiU length gene from a suitable library (e.g., a tumor cDNA library) 
using well known techniques. Within such techniques, a library (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 

30 Preferably, a library is size-selected to include larger molecules. Random primed 
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libraries may also be preferred for identifying 5' and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5* sequences. 

For hybridization techniques, a partial sequence may be labeled (e.g., by 
nick-translation or end-labeling with ^^P) using well known techniques. A bacterial or 

5 bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
(see Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989). Hybridizmg colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 

10 be analyzed to determine the amount of additional sequence by, for example, PCR using 
a primer JBrom the partial sequence and a primer from the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. The 
complete sequence may then be determined using standard techniques, which may 
involve generating a series of deletion clones. The resulting overlapping sequences can 

15 then assembled into a single contiguous sequence. A full length cDNA molecule can be 
generated by ligating suitable fragments, using well known techniques. 

Alternatively, amplification techniques, such as those described above, 
can be useful for obtaining a fiill length coding sequence from a partial cDNA sequence. 
One such amplification technique is inverse PCR (see Triglia et al., Nucl. Acids Res. 

20 16:Sli6, 1988), which uses restriction enzymes to generate a firagment in the known 
region of the gene. The fragment is then circularized by intramolecular ligation and 
used as a template for PCR with divergent primers derived from the known region. 
Within an alternative approach, sequences adjacent to a partial sequence may be 
retrieved by amplification with a primer to a linker sequence and a primer specific to a 

25 known region. The amplified sequences are typically subjected to a second round of 
amplification with the same linker primer and a second primer specific to the known 
region. A variation on this procedure, which employs two primers that mitiate 
extension in opposite directions from the known sequence, is described in WO 
96/38591. Another such technique is known as "rapid amplification of cDNA ends" or 

30 RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5* 
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and 3' of a known sequence. Additional techniques include capture PGR (Lagerstrom et 
al, PCR Methods Applic. h\ 11-19, 1991) and walking PGR (Parker et al., Nucl Acids, 
Res. iP:3055-60, 1991). Other methods employing amplijBcation may also be employed 
to obtain a fiill length cDNA sequence. 
5 In certain instances, it is possible to obtam a foil length cDNA sequence 

by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
that available from GenBank. Searches for overlapping ESTs may generally be 
performed using well known programs (e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous full length sequence. Full length DNA sequences 
1 0 may also be obtained by analysis of genomic fragments. 

In other embodiments of the invention, polynucleotide sequences or 
fragments thereof which encode polypeptides of the invention, or fusion proteins or 
functional equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 
15 the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
be used to clone and express a given polypeptide. 

As will be understood by those of skill in the art, it may be advantageous 
in some instances to produce polypeptide-encoding nucleotide sequences possessing 
20 non-naturally occurring codons. For example, codons preferred by a particular 
prokaryotic or eukaiyotic host can be selected to increase the rate of protein expression 
or to produce a recombinant RNA traascript having desirable properties, such as a half- 
life which is longer than that of a transcript generated from the naturally occurring 
sequence. 

25 Moreover, the polynucleotide sequences of the present invention can be 

engineered using methods generaUy known in the art m order to alter polypeptide 
encodmg sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 
example, DNA shufifling by random fragmentation and PCR reassembly of gene 

30 fragments and synthetic oUgonucleotides may be used to engmeer the nucleotide 
sequences. In addition, site-duected mutagenesis may be used to insert new restriction 
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sites, alter glycosylation patterns, change codon preference, produce sphce variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombmant nucleic acid sequences may be ligated to a heterologous sequence to 
5 encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
polypeptide activity, it may be useful to encode a chimeric protein that can be 
recognized by a commercially available antibody. A fusion protein may also be 
engineered to contain a cleavage site located between the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
10 and purified away from the heterologous, moiety. 

Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Camthers, M. H. et al. 
(1980) Nucl Acids Res, Symp. Ser. 215-223, Horn, T. et al. (1980) Nucl Acids Res, 
Symp. Ser, 225-232). Alternatively, the protein itself may be produced using chemical 
15 methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof 
For example, peptide synthesis can be performed using various soUd-phase techniques 
(Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be 
achieved, for example, usmg the ABI 431 A Peptide Synthesizer (Perkm Ehner, Palo 
Alto, CA). 

20 A newly synthesized peptide may be substantially purified by preparative 

high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable 
techniques available m the art. The composition of the synthetic peptides may be 
confirmed by ammo acid analysis or sequencing (e.g., the Edman degradation 

25 procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined usmg chemical methods with 
sequences from other protems, or any part thereof, to produce a variant polypeptide. 

In order to express a desired polypeptide, the nucleotide sequences 
encoding tiie polypeptide, or functional equivalents, may be inserted into appropriate 

30 expression vector, le., a vector which contains ttie necessary elements for tiie 
transcription and translation of tiie inserted codmg sequence. Metiiods which are well 
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known to those skilled in the art may be used to construct expression vectors contaming 
sequences encoding a polypeptide of interest and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombmation. Such techniques 
5 are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 
al (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. 
N.Y. 

A variety of expression vector/host systems may be utilized to contain 

10 and express polynucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 
insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 

15 CaMV; tobacco mosaic vims, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. 

The "control elements" or "regvdatory sequences" present in an 
expression vector are those non-translated regions of the vector-enhancers, promoters, 
5* and 3' untranslated regions-which interact with host cellular proteins to carry out 

20 transcription and translation. Such elements may vary in their strength and specificity. 
Dependmg on the vector system and host utilized, any nimiber of suitable transcription 
and translation elements, mcluding constitutive and inducible promoters, may be used. 
For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or 

25 PSPORTl plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters firom mammalian genes or from mammalian viruses 
are generally preferred. If it is necessary to generate a cell line that contains multiple 
copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be 
advantageously used with an appropriate selectable marker. 

30 In bacterial systems, any of a number of expression vectors may be 

selected depending upon the use intended for the expressed polypeptide. For example, 
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when large quantities are needed, for example for the induction of antibodies, vectors 
which direct high level expression of fusion proteins that are readily purified may be 
used. Such vectors mclude, but are not limited to, the multiftmctional E. coli cloning 
and expression vectors such as BLUESCRIPT (Stratagene), in which the sequence 
5 encoding the polypeptide of interest may be ligated into the vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. 
M. Schuster (1989) J. Biol Chem, 264:5503-5509); and the like. pGEX Vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 

10 proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 
systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 

15 wiU. 

Li the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
be used. For reviews, see Ausubel et al. (supra) and Grant at al. (1987) Methods 
Enzymol 153:516-544. 

20 hi cases where plant expression vectors are used, the expression of 

sequences encoding polypeptides may be driven by any of a number of promoters. For 
example, viral promoters such as the 35S and 19S promoters of CaMV may be used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. 
(1987) EMBOJ. 5:307-311. Alternatively, plant promoters such as the small subunit of 

25 RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 
5:1671-1680; Broglie, R. et al. (1984) Science 22^:838-843; and Winter, J. et al. (1991) 
Results Probl Cell Differ, 77:85-105). These constructs can be introduced into plant 
cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 
are described in a number of generally available reviews (see, for example, Hobbs, S. or 

30 Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
Hill, New York, N.Y.; pp. 191-196). 
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An insect system may also be used to express a polypeptide of interest. 
For example, in one such system, Autographa califomica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera jfrugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 

5 non-essential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 

10 (Engelhard, E. K. et al. (1994) Proc. Natl Acad. ScL 91 :3224-3227). 

hi mammalian host cells, a number of viral-based expression systems are 
generally available. For example, in cases where an adenovirus is used as an expression 
vector, sequences encoding a polypeptide of interest may be Ugated mto an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 

15 sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 
to obtain a viable virus which is capable of expressing the polypeptide in mfected host 
cells (Logan, J. and Shenk, T. (1984) Proc. Natl Acad Set 57:3655^3659). hi addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 

20 Specific initiation signals may also be used to achieve more efficient 

translation of sequences encodmg a polypeptide of interest. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational control 

25 signals may be needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the mitiation codon should be in the correct 
reading frame to ensure translation of the entire msert. Exogenous translational 
elements and mitiation codons may be of various origins, both natural and synthetic. 

30 The efi&ciency of expression may be enhanced by the inclusion of enhancers which are 
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appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al. (1994) Results Probl Cell Differ, 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate 
the expression of the inserted sequences or to process the expressed protein in the 
5 desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/or ftmction. Different host cells such as 
CHO, COS, HeLa, MDCK, HEK293, and WI38, which have specific ceUular machinery 

10 and characteristic mechanisms for such post-translational activities, may be chosen to 
ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is generally preferred. For example, cell lines which stably express a 
polynucleotide of interest may be transformed using expression vectors which may 

15 contain viral origins of replication and/or endogenous expression elements and a 
selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for 1-2 days in an enriched media before 
they are switched to selective media. The purpose of the selectable marker is to confer 
resistance to selection, and its presence allows growth and recovery of cells which 

20 successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be prohferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thymidine 
kinase (Wigler, M. et al. (1977) Cell 77:223-32) and adenine phosphoribosyltransferase 

25 (Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.- or 
aprt.sup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 
be used as the basis for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler, M. et al. (1980) Proc. Natl Acad Set 77:2561 -IQ)', npt, which 
confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et 

30 al (1981) 1 Mol Biol 750:1-14); and als or pat, which confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 
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Additional selectable genes have been described, for example, trpB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 
place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl Acad. Sci. 
55:8047-51). The use of visible markers has gained popularity with such markers as 
5 anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its substrate 
luciferin, being widely used not only to identify transformants, but also to quantify the 
amount of transient or stable protein expression attributable to a specific vector system 
(Rhodes, C. A. et al. (1995) Methods Mol Biol 5J:12M31). 

Although the presence/absence of marker gene expression suggests that 
10 the gene of mterest is also present, it$ presence and expression may need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted v^thin a 
marker gene sequence, recombinant cells containing sequences can be identified by the 
absence of marker gene function. Altematively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence under the control of a single promoter. 
15 Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Altematively, host cells that contain and express a desired 
polynucleotide sequence may be identified by a variety of procedures known to those of 
skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA- 
20 RNA hybridizations and protein bioassay or immunoassay techniques which include, 
for example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
' polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 
25 specific for tiie product are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), mdioimmunoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 
preferred for some applications, but a competitive binding assay may also be employed. 
30 These and other assays are described, among other places, in Hampton, R. et al. (1990; 
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Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E. etal. (\m;IExp. Med 755:121 M216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid assays. Means 

5 for producing labeled hybridization or PGR probes for detecting sequences related to 
polynucleotides include oligolabeling, nick translation, end-labeling or PGR 
amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the art, are commercially available, and may be used to synthesize RNA 

10 probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 
as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

1 5 Host cells transformed with a polynucleotide sequence of interest may be 

cultured under conditions suitable for the expression and recovery of the protein fi-om 
cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides of the 

20 mvention may be designed to contain signal sequences which dbrect secretion of the 
encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facihtate 
purification of soluble proteins. Such purification facilitating domains include, but are 

25 not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on inamobiiized metals, protein A domains that allow pijrification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (hnmunex Gorp., Seattle, Wash.). The mclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 

30 Galif.) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fiision 
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protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as 
described in Porath, J. et al. (1992, Prot Exp, Purif, 5:263-281) while the enterokinase 
5 cleavage site provides a means for purifying the desired polypeptide from the ftision 
protem. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 
et al. (1993; DNA Cell Biol 72:441-453). 

In addition to recombinant production methods, polypeptides of the 
invention, and fragments thereof, may be produced by direct peptide synthesis using 

10 solid-phase techniques (Merrifield J. (1963) J. Am, Chem. Soa 55:2149-2154). Protem 
synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 
synthesized separately and combined using chemical methods to produce the full length 

15 molecule. 

Antibody Compositions. Fragments Thereof and Other Binding Agents 

According to another aspect, the present invention further provides 
binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit 
immunological binding to a tumor polypeptide disclosed herein, or to a portion, variant 

20 or derivative thereof An antibody, or antigen-binding fragment thereof, is said to 
"specifically bind," "immunogically bind," and/or is "immunologically reactive" to a 
polypeptide of the invention if it reacts at a detectable level (within, for example, an 
ELISA assay) with the polypeptide, and does not react detectably with unrelated 
polypeptides under similar conditions. 

25 Immunological binding, as used in this context, generally refers to the 

non-covalent interactions of the type which occur between an immunoglobulin 
molecule and an antigen for which the immunoglobulin is specific. The strength, or 
affinity of immunological binding interactions can be expressed in terms of the 
dissociation constant (K<j) of the interaction, wherein a smaller Kd represents a greater 

30 affinity. Immunological bmding properties of selected polypeptides can be quantified 
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using methods well known in the art. One such method entails measuring the rates of 
antigen-binding site/antigen complex fomation and dissociation, wherein those rates 
depend on the concentrations of the complex partners, the affinity of the interaction, and 
on geometric parameters that equally influence the rate in both directions. Thus, both 
5 the "on rate constant" (Kon) and the "off rate constant" (Koff) can be determined by 
calculation of the concentrations and the actual rates of association and dissociation. 
The ratio of Koff /Kon enables cancellation of all parameters not related to affinity, and is 
thus equal to the dissociation constant Kd. See, generally, Davies et al. (1990) Annual 
Rev. Biochem. 59:439-473. 

10 An "antigen•^binding site," or "binding portion" of an antibody refers to 

the part of the immxmoglobulin molecule that participates in antigen binding. The 
antigen binding site is formed by amino acid residues of the N-terminal variable ("V") 
regions of the heavy ("H") and light ("L") chains. Three highly divergent stretches 
within the V regions of the heavy and light chains are referred to as "hypervariable 

15 regions" which are interposed between more conserved flanking stretches known as 
"framework regions," or "FRs". Thus the term "FR" refers to amino acid sequences 
which are naturaUy found between and adjacent to hypervariable regions in 
iramunoglobulins. In an antibody molecule, the three hypervariable regions of a light 
chain and the three hypervariable regions of a heavy chain are disposed relative to each 

20 other in three dimensional space to form an antigen-binding surface. The antigen- 
binding surface is complementary to the three-dimensional surface of a bound antigen, 
and the three hypervariable regions of each of the heavy and hght chains are referred to 
as "complementarity-determining regions," or "CDRs." 

Binding agents may be fijrther capable of differentiating between patients 

25 with and without a cancer, such as breast cancer, usiag the representative assays 
provided herein. For example, antibodies or other binding agents that bind to a tumor 
protein will preferably generate a signal indicating the presence of a cancer in at least 
about 20% of patients with the disease, more preferably at least about 30% of patients. 
Alternatively, or in addition, the antibody wiU generate a negative signal indicating the 

30 absence of the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies this requirement, biological samples {e.g.. 
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blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a 
cancer (as determined using standard clinical tests) may be assayed as described herein 
for the presence of polypeptides that bind to the binding agent. Preferably, a statistically 
significant number of samples with and without the disease will be assayed. Each 
5 binding agent should satisfy the above criteria; however, those of ordinary skill m the 
art will recognize that binding agents may be used in combmation to improve 
sensitivity. 

Any agent that satisfies the above requirements may be a binding agent 
For example, a binding agent may be a ribosome, with or without a peptide component, 

10 an RNA molecule or a polypeptide, hi a preferred embodmient, a binding agent is an 
antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of 
a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. Li 
general, antibodies can be produced by cell culture techniques, including the generation 

15 of monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 
recombmant antibodies. In one technique, an immunogen comprismg the polypeptide is 
initially mjected into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep 
or goats). In this step, the polypeptides of this invention may serve as the umnunogen 

20 without modification. Alternatively, particularly for relatively short polypeptides, a 
superior unmune response may be elicited if the polypeptide is joined to a carrier 
protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen 
is injected into the animal host, preferably according to a predetermined schedule 
incorporating one or more booster hmnunizations, and the ammals are bled periodically. 

25 Polyclonal antibodies specific for the polypeptide may then be purified from such 
antisera by, for example, affmity chromatography using tiie polypeptide coupled to a 
suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of mterest 
may be prepared, for example, using tiie technique of Kohler and Milstem, Etir. J. 

30 Immunol tf:511-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of umnortal cell lines capable of producing antibodies having the 
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desired specificity {Le., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
5 animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 

10 colonies of hybrids are observed. Single colonies are selected and their culture 
supematants tested for binding activity against the polypeptide. Hybridomas having 
high' reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated firom the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 

15 yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 

20 in, for example, an aJBBnity chromatography step. 

A number of therapeutically useful molecules are known in the art which 
comprise antigen-binding sites that are capable of exhibiting immunological binding 
properties of an antibody molecule. The proteolytic enzyme papain preferentially 
cleaves IgG molecules to yield several fragments, two of which (the "F(ab)" fragments) 

25 each comprise a covalent heterodimer that includes an intact antigen-binding site. The 
enzyme pepsin is able to cleave IgG molecules to provide several fragments, including 
the "F(ab')2 " fragment which comprises both antigen-binding sites. An "Fv" fragment 
can be produced by preferential proteolytic cleavage of an IgM, and on rare occasions 
IgG or IgA immunoglobulin molecule. Fv fragments are, however, more commonly 

30 derived using recombinant techniques known in the art. The Fv fragment includes a 
non-covalent Vh-Vl heterodimer including an antigen-binding site which retains much 
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of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2559-2662; Hochman et al. (1976) 
Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

A single chain Fv ("sFv") polypeptide is a covalently linked Vh::Vl 
5 heterodimer which is expressed from a gene fusion including Vr- and VL-encoding 
genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
USA 85(16):5879-5883. A number of methods have been described to discern chemical 
structures for converting the naturally aggregated~but chemically separated-light and 
heavy polypeptide chains from an antibody V region into an sFv molecule which will 

10 fold into a three dimensional structure substantially similar to the structure of an 
antigen-binding site. See, e.g., U.S. Pat. Nos. 5,091,513 and 5,132,405, to Huston et al.; 
and U.S. Pat. No. 4,946,778, to Ladner et al. 

Each of the above-described molecules mcludes a heavy chain and a 
light chain CDR set, respectively interposed between a heavy chain and a light chain FR 

15 set which provide support to the CDRS and define the spatial relationship of the CDRs 
relative to each other. As used herein, the term "CDR set" refers to the three 
hypervariable regions of a heavy or ligjht chain V region. Proceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "CDRl," "CDR2," and 
"CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 

20 comprising the CDR set from each of a heavy and a light chain V region. A polypeptide 
comprising a single CDR, (e.g., a CDRl, CDR2 or CDR3) is referred to herein as a 
"molecular recognition unit." Crystallographic analysis of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs form extensive 
contact with boimd antigen, wherein the most extensive antigen contact is with the 

25 heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for 
the specificity of an antigen-binding site. 

As used herein, the term "FR set" refers to the four flanking amino acid 
sequences which frame the CDRs of a CDR set of a heavy or Ught chain V region. 
Some FR residues may contact bound antigen; however, FRs are primarily responsible 

30 for folding the V region into the antigen-binding site, particularly the FR residues 
directly adjacent to the CDRS. Within FRs, certain amino residues and certain stmctural 
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features are very highly conserved, hi this regard, all V region sequences contain an 
internal disulfide loop of around 90 amino acid residues. When the V regions fold into a 
bmding-site, the CDRs are displayed as projecting loop motifs vs^hich form an antigen- 
bindmg surface. It is generally recognized that there are conserved structural regions of 
5 FRs which influence the folded shape of the CDR loops into certain "canonical" 
structures-regardless of the precise CDR amino acid sequence. Further, certain FR 
residues are known to participate in non-covalent mterdomain contacts which stabilize 
the interaction of the antibody heavy and light chains. 

A number of "humanized" antibody molecules comprising an antigen- 

10 binding site derived from a non-human immunoglobulin have been described, including 
chimeric antibodies having rodent V regions and their associated CDRs fused to human 
constant domams (Winter et al. (1991) Nature 349:293-299; Lobuglio et al. (1989) 
Proc. Nat. Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J hnmunol. 138:4534- 
4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a 

15 human supporting FR prior to fusion with an appropriate human antibody constant 
domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs 
supported by recombinantly veneered rodent FRs (European Patent Publication No. 
519,596, published Dec. 23, 1992). These "humanized" molecxiles are designed to 

20 minimize unwanted unmunological response toward rodent antihuman antibody 
molecules which limits the duration and effectiveness of therapeutic applications of 
those moieties in human recipients. 

As used herein, the terms "veneered FRs" and "recombinantly veneered 
FRs" refer to the selective replacement of FR residues from, e.g., a rodent heavy or Ught 

25 chain V region, Mdth human FR residues in order to provide a xenogeneic molecule 
comprising an antigen-binding site which retains substantially all of the native FR 
polypeptide folding structure. Veneering techniques are based on the understanding that 
the Ugand binding characteristics of an antigen-buiding site are determined primarily by 
the structure and relative disposition of the heavy and light chain CDR sets within the 

30 antigen-binding surface. Davies et al. (1990) Ann. Rev. Biochem. 59:439-473. Thus, 
antigen binding specificity can be preserved in a humanized antibody only wherein the 
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CDR structures, their interaction with each other, and their interaction with the rest of 
the V region domains are carefully maintained. By using veneering techniques, exterior 
(e.g., solvent-accessible) FR residues which are readily encountered by the unmune 
system are selectively replaced with hinnan residues to provide a hybrid molecule that 
5 comprises either a weakly immunogenic, or substantially non-immunogenic veneered 
surface. 

The process of veneering makes use of the available sequence data for 
human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of 
Immunological Interest, 4th ed., (U.S. Dept. of Health and Human Services, U.S. 

10 Government Printing Office, 1987), updates to the Kabat database, and other accessible 
U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be deduced from the known three-dimensional structure for 
human and murine antibody fragments. There are two general steps in veneering a 
murine antigen-binding site. Initially, the FRs of the variable domains of an antibody 

15 molecule of interest are compared with corresponding FR sequences of human variable 
domams obtained from the above-identified sources. The most homologous human V 
regions are then compared residue by residue to corresponding murine amino acids. The 
residues in the murine FR which differ from the human counterpart are replaced by the 
residues present in the human moiety using recombmant techniques well known in the 

20 art. Residue switching is only carried out with moieties which are at least partially 
exposed (solvent accessible), and care is exercised in the replacement of amino acid 
residues which may have a significant effect on the tertiary structure of V region 
domains, such as proline, glycine and charged amino acids. 

In this manner, the resultant "veneered" murine antigen-binding sites are 

25 thus designed to retain the murine CDR residues, the residues substantially adjacent to 
the CDRs, the residues identified as buried or mostly buried (solvent inaccessible), the 
residues believed to participate in non-covalent (e.g., electrostatic and hydrophobic) 
contacts between heavy and light chain domains, and the residues from conserved 
structural regions of the FRs which are believed to influence the "canonical" tertiary 

30 structures of the CDR loops. These design criteria are then used to prepare recombinant 
nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
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murine antigen-binding site into human-appearing FRs that can be used to transfect 
mammalian cells for the expression of recombinant human antibodies which exhibit the 
antigen specificity of the murine antibody molecule. 

In another embodiment of the invention, monoclonal antibodies of the 
5 present invention may be coupled to one or more therapeutic agents. Suitable agents in 
this regard include radionuclides, differentiation mducers, drugs, toxins, and derivatives 
thereof Preferred radionuclides mclude ^°Y, '^^I, ^^^I, ^^^e, ^*^Re, ^^^At, and 
^^^Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 

10 toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas 
exotoxin. Shigella toxin, and pokeweed antiviral protein. 

A therapeutic agent may be coupled (e.g., covalently bonded) to a 
suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A 
direct reaction between an agent and an antibody is possible when each possesses a 

15 substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- 
containing group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group (e.g., a halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 

20 antibody via a linker group. A linker group can function as a spacer to distance an 
antibody from an agent in order to avoid interference with binding capabilities. A linker 
group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibody, and thus increase the coupling efficiency. An increase in chemical 
reactivity may also faciUtate the use of agents, or ftmctional groups on agents, which 

25 otherwise would not be possible. 

It will be evident to those skilled ki the art that a variety of bifunctional 
or polyfunctional reagents, both homo- and hetero-fimctional (such as those described in 
the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl groups, 

30 sulftiydryl groups or oxidized carbohydrate residues. There are numerous references 
describing such metiiodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 
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Where a therapeutic agent is more potent when free from the antibody 
portion of the immnnoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon mtemalization into a cell A number of 
difterent cleavable linker groups have been described. The mechanisms for the 
5 intracellular release of an agent from these linker groups include cleavage by reduction 
of a disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a 
photolabile bond (e.g,, U.S. Patent No. 4,625,014, to Senter et al.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al), by 
serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell 

10 et al.), and acid-catalyzed hydrolysis {e.g, U.S. Patent No. 4,569,789, to Blattler et al). 

It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 

15 may be prepared in a variety of ways. For example, more than one agent may be 
coupled directly to an antibody molecule, or linkers that provide multiple sites for 
attachment can be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a linker group. Suitable carriers include proteins such as 

20 albumins (e.g, U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
such as aminodextran (e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionuclide agents include radiohalogenated small molecules and chelating 

25 compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
formed from chelating compounds that include those containing nitrogen and sulflir 
atoms as the donor atoms for binduig the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 

30 compounds and their synthesis. 
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T Cell Compositions 

The present invention, in another aspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof Such cells 
may generally be prepared in vitro or ex vivo, using standard procedures. For example, 
5 T cells may be isolated from bone marrow, peripheral blood, or a fraction of bone 
marrow or peripheral blood of a patient, using a commercially available cell separation 
system, such as the Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, 
CA; see also U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or 

1 0 vmrelated hiimans, non-human mammals,, cell lines or cultures. 

T cells may be stimulated with a polypeptide, polynucleotide encoding a 
polypeptide and/or an antigen presenting cell (AFC) that expresses such a polypeptide. 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T cells that are specific for the polypeptide of interest. Preferably, a tumor 

1 5 polypeptide or polynucleotide of the invention is present within a delivery vehicle, such 
as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the present 
invention if the T cells specifically proliferate, secrete cytokines or kill target cells 
coated with the polypeptide or expressing a gene encoding the polypeptide, T cell 

20 specificity may be evaluated using any of a variety of standard techniques. For 
example, within a chromium release assay or proliferation assay, a stimulation index of 
more than two fold increase in lysis and/or proliferation, compared to negative controls, 
indicates T cell specificity. Such assays may be performed, for example, as described in 
Chen et al., Cancer Res, 5^:1065-1070, 1994. Alternatively, detection of the 

25 proliferation of T cells may be accomplished by a variety of known techniques. For 
example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labelmg cultures of T cells witii tritiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
tumor polypeptide (100 ng/ml - 100 fxg/ml, preferably 200 ng/ml - 25 Mg/mJ) for 3 - 7 

30 days will typically result in at least a two fold increase in proliferation of the T cells. 
Contact as described above for 2-3 hours should result in activation of the T cells, as 
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measured using standard cytokine assays in which a two fold increase in the level of 
cytokine release (e,g., TNF or IFN-y) is indicative of T cell activation (see Coligan et 
al., Current Protocols in Immunology, vol 1, Wiley Interscience (Greene 1998)), T 
cells that have been activated in response to a tumor polypeptide, polynucleotide or 

5 polypeptide-expressing APC may be CD4^ and/or CD8^. Tumor polypeptide-specific T 
cells may be expanded using standard techniques. Within preferred embodiments, the T 
cells are derived from a patient, a related donor or an unrelated donor, and are 
administered to the patient following stimulation and expansion. 

For therapeutic purposes, CD4^ or CD8+ T cells that proliferate in 

10 response to a tumor polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a 
short peptide corresponding to an unmunogenic portion of such a polypeptide, with or 
without the addition of T cell growth factors, such as interleukin-2, and/or stimulator 

15 cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 
cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 

Pharmaceutical Compositions 

20 In additional embodiments, the present invention concerns formulation 

of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions 
disclosed herein in pharmaceutically-acceptable carriers for administration to a cell or 
an animal, either alone, or in combination with one or more other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed herein 

25 may be administered in combination with other agents as well, such as, e.g., other 
proteins or polypeptides or various pharmaceutically-active agents. In fact, there is 
virtually no limit to other components that may also be included, given that the 
additional agents do not cause a significant adverse effect upon contact with the target 
cells or host tissues. The compositions may thus be delivered along with various other 

30 agents as required in the particular instance. Such compositions may be purified from 
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host cells or other biological sources, or alternatively may be chemically synthesized as 
described herein. Likewise, such compositions may further comprise substituted or 
derivatized RNA or DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 
5 compositions are provided comprising one or more of the polynucleotide, polypeptide, 
antibody, and/or T-cell compositions described herein in combination with a 
physiologically acceptable carrier, hi certain preferred embodiments, the 
pharmaceutical compositions of the invention comprise inmiunogenic polynucleotide 
and/or polypeptide compositions of the invention for use in prophylactic and theraputic 
10 vaccine applications. Vaccine preparation is generally described in, for example, M.F. 
Powell and M.J. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," 
Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 
polynucleotide and/or polypeptide compositions of the present mvention in combmation 
with one or more immunosthnulants. 
15 It will be apparent that any of the pharmaceutical compositions described 

herein can contain pharmaceutically acceptable salts of the polynucleotides and 
polypeptides of the invention. Such salts can be prepared, for example, from 
pharmaceutically acceptable non-toxic bases, including organic bases {e.g., salts of 
primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e,g,, 
20 sodium, potassium, lithium, ammonium, calcium and magnesium salts). 

In another embodiment, illustrative immimogenic compositions, e.g., 
vaccine compositions, of the present mvention comprise DNA encoding one or more of 
the polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of deUvery 
25 systems known to those of ordinary skill in the art. Indeed, nmnerous gene delivery 
techniques are well known in the art, such as those described by Rolland, Crit. Rev. 
Therap. Drug Carrier Systems 7J:143-198, 1998, and references cited therein. 
Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 
30 promoter and terminating signal). Alternatively, bacterial dehvery systems may involve 
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the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that expresses an 
immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding 
immunogenic polypeptides described herem are introduced into suitable mammaUan 
5 host cells for expression using any of a number of known viral-based systems. In one 
illustrative embodimentj retroviruses provide a convenient and effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 
present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombuiant virus can then be isolated and delivered 

10 to a subject. A number of illustrative retroviral systems have been described (e.g., U.S. 
Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Humao Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Bums 
et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet. Develop. 3:102-109. 

15 In addition, a number of illustrative adenovirus-based systems have also 

been described. Unlike retroviruses which integrate into the host genome, adenoviruses 
persist extrachromosomally thus minimizing the risks associated with insertional 
mut^enesis (Haj-Ahmad and Graham (1986) J. ViroL 57:267-274; Bett et al. (1993) J. 
Virol. 67:5911-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 

20 al. (1994) J. Virol. 68:933-940; Barr et al. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 
developed for polynucleotide delivery. AAV vectors can be readily constructed using 

25 techniques well known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; 
International Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. 
(1988) Molec. Cell. BioL 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion m Biotechnology 3:533- 
539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 158:97-129; 

30 Kotin, R. M. (1994) Human Gene Therapy 5:793-801; SheUing and Smith (1994) Gene 
Therapy 1 :165-169; and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 
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Additional viral vectors useful for delivering the polynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 
from the pox family of viruses, such as vaccinia virus and avian poxvirus. By way of 
example, vaccinia virus recombinants expressing the novel molecules can be 
5 constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 
then used to transfect cells which are simultaneously infected with vaccinia. 
Homologous recombination serves to insert the vaccinia promoter plus the gene 

10 encoding the polypeptide of interest into the viral genome. The resulting TK.sup.(-) 
recombinant can be selected by culturing the cells in the presence of 5- 
bromodeox5airidine and picking viral plaques resistant thereto. 

A vaccinia-based infection/transfection system can be conveniently used 
to provide for inducible, transient expression or coexpression of one or more 

15 polypeptides described herein in host cells of an organism. In this particular system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the polynucleotide or polynucleotides of interest, driven by a T7 

20 promoter. The polymerase expressed in the cytoplasm from the vaccmia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high level, 
transient, cytoplasmic production of large quantities of RNA and its translation 
products. See, e.g., Elroy-Stem and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 

25 6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxvfruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding sequences of interest. Recombinant avipox 
viruses, expressing immunogens from mammalian pathogens, are known to confer 
protective inununity when administered to non-avian species. The use of an Avipox 

30 vector is particularly desirable in human and other mammaUan species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
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therefore are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 
above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 

89/03429; and WO 92/03545. 

5 Any of a number of alphavirus vectors can also be used for delivery of 

polynucleotide compositions of the present invention, such as those vectors described in 
U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based 
on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 

1 0 Moreover, molecular conjugate vectors, such as the adenovirus chimeric 

vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al. Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene deUvery 

under the invention. 

Additional illustrative information on these and other known viral-based 
1 5 delivery systems can be found, for example, in Fisher-Hoch et al., Froc. Natl. Acad. Sci. 
USA 56:317-321, 1989; Flexner et al., Ann N.Y. Acad. Sci. 56P:86-103, 1989; Flexner 
et al.. Vaccine S:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; 
WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques 5:616-627, 1988; Rosenfeld et al.. Science 
20 252:431-434, 1991; KoUs et al., Proc. Natl. Acad. Sci. USA Pi:215-219, 1994; 
Kass-Eisler et al., Proc. Natl. Acad Sci. USA 90:11498-11502, 1993; Guzman et al.. 
Circulation 55:2838-2848, 1993; and Guzman et al., Cir. Res. 75:1202-1207, 1993. 

In certain embodiments, a polynucleotide may be integrated into the 
genome of a target cell. ITiis integration may be in the specific location and orientation 
25 via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet finther embodiments, the 
polynucleotide may be stably maintained in the ceU as a separate, episomal segment of 
DNA. Such polynucleotide segments or "episomes" encode sequences sufficient to 
permit maintenance and replication independent of or in synchronization with the host 
30 ceU cycle. The matmer in which the expression construct is delivered to a cell and 
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where in the cell the polynucleotide remains is dependent on the type of expression 

construct employed. 

In another embodiment of the invention, a polynucleotide is 

administered/delivered as "naked" DNA, for example as described in Ulmer et al., 
5 Science 259:1745-1749, 1993 and reviewed by Cohen, Science 25P:1691-1692, 1993. 

The uptake of naked DNA may be increased by coating the DNA onto biodegradable 

beads, which are efficiently transported into the cells. 

In still another embodiment, a composition of the present invention can 

be delivered via a particle bombardment approach, many of which have been described. 
10 In one illustrative example, gas-driven particle acceleration can be achieved with 

devices such as those manufactured by Powderject Pharaiaceuticals PLC (Oxford, UK) 

and Powderject Vaccines Inc. (Madison, WI), some examples of which are described in 

U.S. Patent Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 

799. This approach offers a needle-free delivery approach wherein a dry powder 
15 formulation of microscopic particles, such as polynucleotide or polypeptide particles, 

are accelerated to high speed within a helium gas jet generated by a hand held device, 

propelling the particles into a target tissue of interest. 

In a related embodunent, other devices and methods that may be useful 

for gas-driven needle-less injection of compositions of the present invention include 
20 those provided by Bioject, Inc. (Portland, OR), some examples of which are described 

in U.S. Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 

and 5,993,412. 

According to another embodiment, the pharmaceutical compositions 
described herein will comprise one or more immunostimulants in addition to the 

25 unmunogenic polynucleotide, polypeptide, antibody, T-cell and/or APC compositions 
of this invention. An immunostimulant refers to essentially any substance that enhances 
or potentiates an immune response (antibody and/or cell-mediated) to an exogenous 
antigen. One preferred type of immunostimulant comprises an adjuvant. Many 
adjuvants contain a substance designed to protect the antigen from r^id catabolism, 

30 such as aluminum hydroxide or mineral oil, and a stimulator of inamune responses, such 
as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 
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Certain adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MT); Merck Adjuvant 
65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithBCline Beecham, Philadelphia, 
PA); aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; 
5 salts of calcium, iron or 2dnc; an insoluble suspension of acylated tyrosine; acylated 
sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 
GM-CSF, interleukin-2, -7, -12, aud other like growth factors, may also be used as 
adjuvants. 

10 Within certain embodiments of the invention, the adjuvant composition 

is preferably one that mduces an immune response predominantly of the Thl type. High 
levels of Thl -type cytokines (e.g., LFN-y, TNFa, IL-2 and IL.12) tend to favor the 
induction of cell mediated immune responses to an administered antigen, la. contrast, 
high levels of Th2-type cytokines (c,g.. IL-4, IL-5, 11^6 and IL-10) tend to favor the 
15 induction of humoral immune responses. FoUowing application of a vaccine as 
provided herein, a patient will support an immune response that includes Thl- and Th2- 
type responses. Within a preferred embodiment, in which a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent flian the level 
of Th2-type cytokines. The levels of these cytokines may be readily assessed using 
20 standard assays. For a review of the fatnilies of cytokines, see Mosmann and Coffinan, 
Ann. Rev. Immunol. 7:145-173, 1989. 

Certain preferred adjuvants for eliciting a predominantly Thl-type 
response include, for example, a combination of monophosphoryl lipid A, preferably 3- 
de-O-acylated monophosphoryl lipid A, together with an aluminum salt. MPL® 
25 adjuvants are available from Corixa Corporation (Seattle, WA; see, for example, US 
Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing 
oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a 
predominantly Thl response. Such oUgonucleotides are weU known and are described, 
for example, in WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 
30 5,856,462. hnmunostimulatory DNA sequences are also described, for example, by 
Sato et al.. Science 275:352, 1996. Another prefenred adjuvant comprises a saponin. 
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such as Quil A, or derivatives thereof, including QS21 and QS7 (Aquila 
Biopharmaceuticals Inc., Framingham, MA); Escin; Digitonin; or Gypsophila or 
Chenopodium quinoa saponins . Other preferred formulations include more than one 
sapomn in the adjuvant combinations of the present invention, for example 
5 combmations of at least two of the following group comprising QS21, QS7, Quil A, p- 
escin, or digitonin. 

Alternatively the saponin formulations may be combined with vaccine 
vehicles composed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix, 

10 particles composed of polysaccharides or chemically modified polysaccharides, 
liposomes and lipid-based particles, particles composed of glycerol monoesters, etc. The 
saponins may also be formulated in the presence of cholesterol to fonn particulate 
structures such as liposomes or ISCOMs. Furthermore, the saponms may be formulated 
together with a polyoxyethylene ether or ester, in either a non-particulate solution or 

1 5 suspension, or in a particulate stmcture such as a paucilamelar liposome or ISCOM. The 
saponins may also be formulated with excipients such as Carbopol^ to increase 
viscosity, or may be formulated in a dry powder form with a powder excipient such as 
lactosQ. 

hi one preferred embodiment, the adjuvant system includes the 
20 combmation of a monophosphoryl Upid A and a sapomn derivative, such as the 
combmation of QS21 and 3D-MPL® adjuvant, as described m WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched with cholesterol, as described in 
WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. Another particularly preferred adjuvant formulation employing QS21, 3D- 
25 MPL® adjuvant and tocopherol m an oil-in-water emulsion is described m WO 
95/17210. 

Another enhanced adjuvant system involves the combmation of a CpG- 
containing oligonucleotide and a saponin derivative particularly the combination of 
CpG and QS2] is disclosed in WO 00/09159. Preferably the formulation additionally 
30 comprises an oU in water emulsion and tocopherol. 
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Additional illustrative adjuvants for use in the pharmaceutical 
compositions of the invention include Montanide ISA 720 (Seppic, France), SAF 
(Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series 
of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmilhKline Beecham, Rixensart, 

5 Belgium), Detox (Enhanzyn®) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, 
MT) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described 
in pending U.S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the 
disclosures of which are incorporated herein by reference in their entireties, and 
polyoxyethylene ether adjuvants such as those described in WO 99/52549A1 . 

10 Other preferred adjuvants include adjuvant molecules of the general 

formula 

(I): HO(CH2CH20)„-A-R, 
wherein, n is 1-50, A is a bond or -C(0)-, R is Ci-so alkyl or Phenyl Ci-so alkyl. 

One embodiment of the present invention consists of a vaccine 

15 formulation comprising a polyoxyethylene ether of general formula (I), wherein « is 
between 1 and 50, preferably 4-24, most preferably 9; the R component is C1.50, 
preferably C4-C20 alkyl and most preferably C12 alkyl, and ^ is a bond. The 
concentration of the polyoxyethylene ethers should be in the range 0.1-20%, preferably 
from 0.1-10%, and most preferably in the range 0.1-1%. Preferred polyoxyethylene 

20 ethers are selected from the following group: polyoxyethylene-9-lauryl ether, 
poIyoxyethylene-9-steoryl ether, polyoxyethylene-S-steoryl ether, polyoxyethylene-4- 
lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 
Polyoxyethylene ethers such as polyoxyelhylene lauryl ether are described in the Merck 
index (12* edition: entry 7717). These adjuvant molecules are described in WO 

25 99/52549. 

The polyoxyethylene ether according to the general formula (T) above 
may, if desired, be combined with another adjuvant. For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 
GB 9820956.2. 

30 According to another embodiment of this invention, an immunogenic 

composition described herein is delivered to a host via antigen presenting cells (APCs), 
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such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may, but need not, be genetically modified 
to increase the capacity for presenting the antigen, to improve activation and/or 
maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
5 immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs 
may generally be isolated firom any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologoiis, allogeneic, syngeneic 
or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic 
10 cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 392:245-251, 1998) and have been shown to 
be effective as a physiological adjuvant for eliciting prophylactic or therapeutic 
antitumor immunity (see Timmerman and Levy, Ann Rev. Med. 50:507-529, 1999). In 
general, dendritic cells may be identified based on their typical shape (stellate in situ, 
15 with marked cytoplasmic processes (dendrites) visible in vitro\ their ability to take up, 
process and present antigens with high efficiency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 
surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
vivo, and such modified dendritic cells are contemplated by the present invention. As 
20 an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine (see Zitvogel et al.. Nature Med. 4:594-600, 
1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, 
bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph 

25 nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For 
example, dendritic cells may be differentiated ex vivo by adding a combination of 
cytokines such as GM-CSF, IL-4, IL-13 and/or TNFa to cultures of monocytes 
harvested firom peripheral blood. Alternatively, CD34 positive cells harvested from 
peripheral blood, umbilical cord blood or bone marrow may be differentiated into 

30 dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFa, 
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CD40 ligand, LPS, flt3 ligand and/or other compoimd(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 
cells, which allows a snnple way to discriminate between two well characterized 

5 phenotypes. However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. hnmature dendritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 
correlates with the high expression of Fey receptor and mannose receptor. The mature 
phenotype is typically characterized by a lower expression of these markers, but a high 

10 expression of cell surface molecules responsible for T cell activation such as class I and 
class n MHC, adhesion molecules (e.g., CD54 and CD 11) and costimulatory molecules 
{e,g„ CD40, CD80, CD86 and 4-lBB). 

APCs may generally be transfected with a polynucleotide of the 
invention (or portion or other variant thereof) such that the encoded polypeptide, or an 

15 immimogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a pharmaceutical composition comprising such transfected cells 
may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
delivery vehicle that targets a dendritic or other antigen presenting cell may be 
administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex 

20 vivo transfection of dendritic cells, for example, may generally be performed using any 
methods known in the art, such as those described in WO 97/24447, or the gene gun 
approach described by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 
progenitor ceUs with the tumor polypeptide, DNA (naked or within a plasmid vector) or 

25 RNA; or with antigen-expressing recombinant bacterium or viruses {e.g., vaccinia, 
fowlpox, adenovims or lentivirus vectors). Prior to loading, the polypeptide may be 
covalently conjugated to an immunological pariner that provides T cell help (e.g., a 
carrier molecule). Alternatively, a dendritic cell may be pulsed v^th a non-conjugated 
immimological partner, separately or in the presence of the polypeptide. 

30 While any suitable carrier known to those of ordinary skill in the art may 

be employed in the pharmaceutical compositions of this invention, the type of carrier 
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will typically vary depending on the mode of administration. Compositions of the 
present invention may be formulated for any appropriate manner of administration, 
including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 
intraperitoneal, subcutaneous and intramuscular administration. 
5 Carriers for use within such pharmaceutical compositions are 

biocompatible, and may also be biodegradable. In certain embodiments, the 
formulation preferably provides a relatively constant level of active component release. 
In other embodiments, however, a more rapid rate of release immediately upon 
administration may be desired. The formulation of such compositions is well within the 

10 level of ordinary skill in the art using known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, 
starch, cellulose, dextran and the like. Other illustrative delayed-release carriers 
include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., 
a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 

15 comprising an amphiphilic compound, such as a phospholipid {see e.g., U.S. Patent No. 
5,151,254 and PCX applications WO 94/20078, WO/94/23701 and WO 96/06638). The 
amount of active compound contained within a sustained release formulation depends 
upon the site of implantation, the rate and expected duration of release and the nature of 
the condition to be treated or prevented. 

20 In another illustrative embodiment, biodegradable microspheres (e.g., 

polylactate polyglycolate) are employed as carriers for the compositions of this 
invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
5,814,344, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems. 

25 such as described m WO/99 40934, and references cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 
comprising particulate-protein complexes, such as those described in U.S. Patent No. 
5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 
responses in a host. 

30 The phamniaceutical compositions of the invention will often further 

comprise one or more buffers (e.g., neutral buffered saline or phosphate buffered 
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saline), carbohydrates {e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 
polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating 
agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that 
render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a 
5 recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

The pharmaceutical compositions described herein may be presented in 
imit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
are typically sealed in such a way to preserve the sterility and stability of the 

10 formulation imtil use. In general, formulations may be stored as suspensions, solutions 
or emulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
may be stored in a fireeze-dried condition reqidring only the addition of a sterile liquid 
carrier immediately prior to use. 

The development of suitable dosing and treatment regimens for using the 

15 particular compositions described herein in a variety of treatment regimens, including 
e.g,, oral, parenteral, intravenous, intranasal, and intramuscular administration and 
formulation, is well known in the art, some of which are briefly discussed below for 
general purposes of illustration. 

In certain applications, the pharmaceutical compositions disclosed herein 

20 may be delivered via oral administration to an animal. As such, these compositions 
may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
tablets, or they may be incorporated durectly with the food of the diet. 

The active compounds may even be incorporated with excipients and 

25 used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like (see, for example, Mathiowdtz et aL, Nature 
1997 Mar 27;386(6623):410-4; Hwang et ah, Crit Rev Ther Drug Carrier Syst 
1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
5,792,451). Tablets, troches, pills, capsules and the like may also contam any of a 

30 variety of additional components, for example, a binder, such as gum tragacanth, acacia, 
cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating agent. 
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such as com starch, potato starch, alginic acid and the like; a lubricant, such as 
magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, m addition to 
5 materials of the above type, a liquid carrier. Various other materials may be present as 
coatmgs or to otherwise modify the physical form of the dosage unit. For instance, 
tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form should be pharmaceutically pure and 
substantially non-toxic in the amounts employed. In addition, the active compounds 

1 0 may be incorporated into sustained-release preparation and formulations. 

Typically, these formidations will contain at least about 0.1% of the 
active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Naturally, the amount of 

15 active compound(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any given unit dose of the compoimd. 
Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
by one skilled hi the art of preparing such pharmaceutical formulations, and as such, a 

20 variety of dosages and treatment regimens may be desurable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients ki the form of a mouthwash, 
dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Alternatively, the active ingredient may be incorporated mto an oral solution such as 

25 one containing sodium borate, glycerm and potassium bicarbonate, or dispersed in a 
dentifrice, or added in a therapeutically-effective amount to a composition that may 
hiclude water, binders, abrasives, flavoring agents, foaming agents, and humectants. 
Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 

30 In certain circumstances it will be desnable to deliver the pharmaceutical 

compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
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intraperitoneally. Such approaches are well known to the skilled artisan, some of which 
are further described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 
and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 
as free base or pharmacologically acceptable salts may be prepared in water suitably 

5 mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 
Under ordinary conditions of storage and use, these preparations generally will contain a 
preservative to prevent the growth of microorganisms. 

Illustrative pharmaceutical forms suitable for injectable use include 

10 sterile aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 
5,466,468). hi all cases the form must be sterile and must be fluid to the extent that 
easy syringability exists. It must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating action of microorganisms, 

15 such as bacteria and ftmgi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol {e,g,, glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 
oils. Proper fluidity may be maintained, for example, by the use of a coating, such as 
lecithin, by the mamtenance of the required particle size m the case of dispersion and/or 

20 by the use of surfactants. The prevention of the action of microorganisms can be 
facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimerosal, and the like, hi many cases, it will be 
preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 

25 the compositions of agents delaymg absorption, for example, aluminum monostearate 
and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, 
the solution should be suitably buffered if necessary and the liquid diluent first rendered 
isotonic with sufficient saline or glucose. These particular aqueous solutions are 
30 especially suitable for intravenous, intramuscular, subcutaneous and mtraperitoneal 
administration. In this connection, a sterile aqueous medium that can be employed will 
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be known to those of skill in the art in light of the present disclosure. For example, one 
dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington^s Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
5 1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human administration, preparations will of 
course preferably meet sterility, pyrogenicity, and the general safety and purity 
standards as required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed 

10 herein may be formulated in a neutral or salt form. Illustrative 
pharmaceutically-acceptable salts include the acid addition salts (formed with the free 
amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 

15 derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
administered in a manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. 

20 The carriers can further comprise any and all solvents, dispersion media, 

vehicles, coatings, diluents, antibacterial and antifimgal agents, isotonic and absorption 
delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 

25 ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. The phrase 
"pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be 

30 delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. 
Methods for deUvering genes, nucleic acids, and peptide compositions directly to the 
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lungs via nasal aerosol sprays has been described, e.g., in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal microparticle 
resins (Takenaga et al, J Controlled Release 1998 Mar 2;52(l-2):81-7) and 
lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871) are also well-known in 
5 the pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form of 
a polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
present invention into suitable host cells/organisms. In particular, the compositions of 

10 the present invention may be formulated for delivery either encapsulated in a lipid 
particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, 
compositions of the present invention can be bound, either covalently or non-covalently, 
to the surface of such carrier vehicles. 

The formation and use of liposome and liposome-like preparations as 

15 potential drug carriers is generally known to those of skill in the art (see for example, 
Lasic, Trends Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 
Mar;56(3):691-5; Chandran et al, Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, 
Crit Rev Ther Drug Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 
Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 

20 5,795,587, each specifically incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that 
are normally difficult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et aU J Biol Chem. 1990 Sep 
25;265(27): 16337-42; MuIIer et al, DNA CeU Biol. 1990 Apr;9(3):221-9). In addition, 

25 liposomes are free of the DNA length constraints that are typical of viral-based deUvery 
systems. Liposomes have been used effectively to introduce genes, various drugs, 
radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
liposomes does not appear to be associated with autounmune responses or unacceptable 

30 toxicity after systemic delivery. 
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In certain embodiments, liposomes are formed from phospholipids that 
are dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilameUar vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for 
5 phannaceutically-acceptable nanocapsule formulations of the compositions of the 
present invention. Nanocapsules can generally entrap compounds in a stable and 
reproducible way (see, for example, Quintanar-Guerrero et ah. Drug Dev Ind Pharm. 
1998 Dec;24(12): 11 13-28). To avoid side effects due to intracellular polymeric 
overloading, such ultrafine particles (sized aroimd 0.1 ym) may be designed using 
10 polymers able to be degraded in vivo. Such particles can be made as described, for 
example, by Couvreur etal, Crit Rev Ther Drug Carrier Syst. 1988;5(l):l-20; zur 
Muhlen et ah, Eur J Pharm Biopharm. 1998 Mar;45(2):149-55; Zambaux et al J 
ControUed Release. 1998 Jan 2;50(l-3):31-40; and U. S. Patent 5,145,684. 

Cancer Therapeutic Methods 

15 hi further aspects of the present invention, the pharmaceutical 

compositions described herein may be used for the treatment of cancer, particularly for 
the immunotherapy of breast cancer. Within such methods, the pharmaceutical 
compositions described herein are administered to a patient, typically a warm-blooded 
animal, preferably a human. A patient may or may not be afflicted with cancer. 

20 Accordingly, the above pharmaceutical compositions may be used to prevent the 
development of a cancer or to treat a patient afQicted with a cancer. Pharmaceutical 
compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treatment such as administration of radiotherapy or 
conventional chemotherapeutic drugs. As discussed above, administration of the 

25 pharmaceutical compositions may be by any suitable method, including administration 
by intravenous, intraperitoneal, mtramuscular, subcutaneous, intranasal, intradermal, 
anal, vaginal, topical and oral routes. 

Within certain embodiments, immimotherapy may be active 
immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous 

30 host inunune system to react against tumors with the administration of unmune 
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response-modifying agents (such as polypeptides and polynucleotides as provided 
herein). 

Within other embodiments, immunotherapy may be passive 
unmimotherapy, in which treatment involves the delivery of agents with established 
5 tumor-immime reactivity (such as effector cells or antibodies) that can directly or 
indirectly mediate antitumor effects and does not necessarily depend on an intact host 
immune system. Examples of effector cells include T cells as discussed above, T 
lymphocytes (such as CD8"*" cytotoxic T lymphocytes and CD4^ T-helper tumor- 
iiifiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 

10 activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and 
macrophages) expressing a polypeptide provided herein. T cell receptors and antibody 
receptors specific for the polypeptides recited herein may be cloned, expressed and 
transferred into other vectors or effector cells for adoptive immunotherapy. The 
polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 

15 antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 
immunotherapy. 

Effector cells may generally be obtauied in sufficient quantities for 
adoptive immunotherapy by grov^ in vitro, as described herein. Culture conditions for 
expanding single antigen-specific effector cells to several billion in nxunber with 

20 retention of antigen recognition in vivo are well known in the art. Such in vitro culture 
conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as IL-2) and non-dividmg feeder cells. As noted above, 
immunoreactive polypeptides as provided herein may be used to rapidly expand 
antigen-specific T cell cultures in order to generate a sufficient nimiber of cells for 

25 immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 
in the art. For example, antigen-presenting cells can be transfected with a 
polynucleotide having a promoter appropriate for mcreasing expression m a 

30 recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long term in vivo. Studies 
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have shoAvn that cultured effector cells can be induced to grow in vivo and to survive 
long term in substantial numbers by repeated stimulation with antigen supplemented 
with IL-2 {see, for example, Cheever et al., Immunological Reviews ] 57:117, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
5 introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient. Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 
intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes and frequency of admmistration of the therapeutic compositions 
10 described herein, as well as dosage, will vary from individual to individual, and may be 
readily established using standard techniques. In general, the pharmaceutical 
compositions and vaccines may be administered by injection (e.g., intracutaneous, 
intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
15 Preferably, 6 doses are administered, at intervals of 1 mondi, and booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 
individual patients. A suitable dose is an amount of a compoimd that, when 
adnodnistered as described above, is capable of promoting an anti-tumor immune 
response, and is at least 10-50% above the basal (i.e., untreated) level. Such response 
20 can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine- 
dependent generation of cytolytic effector cells capable of killing the patient's tumor 
cells in vitro. Such vaccines should also be capable of causing an immune response that 
leads to an improved clinical outcome (e.g., more frequent remissions, complete or 
partial or longer disease-free survival) in vaccinated patients as compared to non- 
25 vaccinated patients. In general, for pharmaceutical compositions and vaccines 
comprising one or more polypeptides, the amount of each polypeptide present in a dose 
ranges from about 25 \ig to 5 mg per kg of host. Smtable dose sizes will vary with the 
size of the patient, but will typically range from about 0.1 mL to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the 
30 active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic 
benefit. Such a response can be monitored by establishing an improved clinical 
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outcome (ag., more frequent remissions, complete or partial, or longer disease-free 
survival) in treated patients as compared to non-treated patients. Increases in 
preexisting immune responses to a tumor protein generally conelate with an improved • 
clinical outcome. Such inumune responses may generally be evaluated using standard 
5 proliferation, cytotoxicity or cytokine assays, which may be performed using samples 
obtained from a patient before and after treatment. 

Cancer Detection and Diagnostic Compositions. Methods and Kits 

In general, a cancer may be detected in a patient based on the presence of 
one or more breast tumor protems and/or polynucleotides encoding such proteins in a 

10 biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) 
obtained from the patient. In other words, such proteins may be used as markers to 
mdicate the presence or absence of a cancer such as breast cancer. In addition, such 
proteins may be usefril for the detection of other cancers. The bmding agents provided 
herein generally permit detection of the level of antigen that bmds to the agent in the 

1 5 biological sample. Polynucleotide primers and probes may be used to detect the level of 
mKNA encoding a tumor protein, which is also indicative of the presence or absence of 
a cancer. In general, a breast tumor sequence should be present at a level that is at least 
three fold higher in tumor tissue than in normal tissue 

There are a variety of assay formats known to those of ordinary skill in 

20 the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory, 
1988. In general, the presence or absence of a cancer in a patient may be determined by 
(a) contacting a biological sample obtained from a patient with a bmding agent; (b) 
detecting m the sample a level of polypeptide that bmds to the binding agent; and (c) 

25 comparing the level of polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding agent 
immobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The boxmd polypeptide may then be detected using a detection 
reagent that contains a reporter group and specifically binds to the binding 

30 agent/polypeptide complex. Such detection reagents may comprise, for example, a 
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binding agent that specifically binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, 
protem A or a lectin. Alternatively, a competitive assay may be utilized, m which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 
5 binding agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide to the 
binding agent is indicative of the reactivity of the sample with the immobilized bmding 
agent. Suitable polypeptides for use within such assays include ftill length breast tumor 
proteins and polypeptide portions thereof to which the binding agent binds, as described 
10 above. 

The solid support may be any material known to those of ordinary skill 
in the art to which the tumor protein may be attached. For example, the solid support 
may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 

15 plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 
using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific Uterature. In the context of the present invention, 

20 the term "immobilization" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 
Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the bindmg agent, in 

25 a suitable buffer, with the solid support for a suitable amount of time. The contact time 
varies with temperature, but is typically between about 1 hour and about I day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 
polyvinylchloride) with an amount of binding agent ranging fi-om about 10 ng to about 
10 fig, and preferably about 100 ng to about 1 jig, is sufficient to immobilize an 

30 adequate amount of binding agent. 
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Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a biftinctional reagent that will react with 
both the support and a ftmctional group, such as a hydroxyl or amino group, on the 
bindmg agent. For example, the binding agent may be covalently attached to supports 
5 having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner {see, e.g.. Pierce Lnmunotechnology Catalog and Handbook, 1991, at 
A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. 

10 This assay may be performed by first contacting an antibody that has been immobilized 
on a solid support, commonly the well of a microtiter plate, with the sample, such that 
polypeptides wdthin the sample are allowed to bind to the immobilized antibody. 
Unbound sample is then removed from the immobilized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody capable of binding to a 

15 different site on the polypeptide) containing a reporter group is added. The amoxmt of 
detection reagent that remains bound to the solid support is then detemuned using a 
method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically 

20 blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as 
bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the antibody. The sample may be diluted with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 

25 time {i.e., incubation time) is a period of time that is sufficient to detect the presence of 
polypeptide within a sample obtained from an individual with breast cancer. Preferably, 
the contact time is sufficient to achieve a level of binding that is at least about 95% of 
that achieved at equilibrium between bound and unbound polypeptide. Those of 
ordinary skill in the art will recognize that the time necessary to achieve equilibrium 

30 may be readily determined by assaying the level of binding that occurs over a period of 
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time. At room temperature, an incubation time of about 30 minutes is generally 
sufficient. 

Unbound sample may then be removed by washing the solid support 
with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
5 antibody, which contains a reporter group, may then be added to the solid support. 
Prefenred reporter groups include those groups recited above. 

The detection reagent is then incubated with the immobilized antibody- 
polypeptide complex for an amo\mt of time sufficient to detect the bound polypeptide. 
An appropriate amount of time may generally be determined by assaying the level of 

10 binding that occurs over a period of time. Unbound detection reagent is then removed 
and bound detection reagent is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 

1 5 and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

20 To determine the presence or absence of a cancer, such as breast cancer, 

the signal detected from the reporter group that remains bound to the solid support is 
generally compared to a signal that corresponds to a predetermined cut-off value. In 
one preferred embodiment, the cut-off value for the detection of a cancer is the average 
mean signal obtained when the immobilized antibody is incubated wdth samples from 

25 patients without the cancer. In general, a sample generating a signal that is three 
standard deviations above the predetermined cut-off value is considered positive for the 
cancer. In an alternate preferred embodiment, the cut-off value is determined using a 
Receiver Operator Curve, according to the method of Sackett et al., Clinical 
Epidemiology: A Basic Science for Clinical Medicine^ Little Brown and Co., 1985, 

30 p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot 
of pairs of true positive rates (/.e., sensitivity) and false positive rates (100%-specificity) 
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that correspond to each possible cut-off value for the diagnostic test result. The cut-off 
value on the plot that is the closest to the upper left-hand comer {Le,^ the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
signal that is higher than the cut-off value determined by this method may be considered 

5 positive. Altematively, the cut-off value may be shifted to the left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative rate, hi 
general, a sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for a cancer. 

In a related embodiment, the assay is performed in a flow-through or 

0 strip test format, wherein the binding agent is immobilized on a membrane, such as 
nitrocellulose. M the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, 
labeled binding agent then binds to the binding agent-polypeptide complex as a solution 
containing the second binding agent flows through the membrane. The detection of 

5 bound second binding agent may then be performed as described above. In the strip test 
format, one end of the membrane to which binding agent is bound is unmersed in a 
solution containing the sample. The sample migrates along the membrane through a 
region containing second binding agent and to the area of immobilized binding agent. 
Concentration of second bbiding agent at the area of inamobilized antibody mdicates the 

10 presence of a cancer. Typically, the concentration of second bindmg agent at that site 
generates a pattern, such as a line, that can be read visually. The absence of such a 
pattern indicates a negative result In general, the amoimt of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattern when the 
biological sarnple contains a level of polypeptide that would be sufficient to generate a 

\5 positive signal in the two-antibody sandwich assay, hi the format discussed above. 
Preferred binding agents for use m such assays are antibodies and antigen-bindkig 
fragments thereof Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about Ijxg, and more preferably from about 50 ng to about 
500 ng. Such tests can typically be performed with a very small amount of biological 

10 sample. 
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Of course, numerous other assay protocols exist that are suitable for use 
with the tumor proteins or binding agents of the present invention. The above 
descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may be readily modified to use 

5 tumor polypeptides to detect antibodies that bind to such polypeptides in a biological 
sample. The detection of such tumor protein specific antibodies may correlate with the 
presence of a cancer. 

A cancer may also, or alternatively, be detected based on the presence of 
T cells that specifically react with a tumor protein in a biological sample. Within 

0 certain methods, a biological sample comprising CD4^ and/or CD8^ T cells isolated 
fi-om a patient is incubated with a tumor polypeptide, a polynucleotide encoding such a 
polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 
Suitable biological samples include, but are not limited to, isolated T cells. For 

5 example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifiigation of peripheral blood lymphocytes). T 
cells may be incubated in vitro for 2-9 days (typically 4 days) at 37°C with polypeptide 
(e.g., 5-25 ^ig/ml). It may be desirable to incubate another aliquot of a T cell sample in 
the absence of tumor polypeptide to serve as a control. For CD4^ T cells, activation is 

0 preferably detected by evaluating proliferation of the T cells. For CDS"*^ T cells, 
activation is preferably detected by evaluating cytolytic activity. A level of proliferation 
that is at least two fold greater and/or a level of cytolytic activity that is at least 20% 
greater than in disease-free patients indicates the presence of a cancer in the patient. 

As noted above, a cancer may also, or alternatively, be detected based on 

5 the level of mRNA encoding a tumor protein in a biological sample. For example, at 
least two oligonucleotide prhners may be employed in a polymerase chain reaction 
(PGR) based assay to amplify a portion of a tumor cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for (ie., 
hybridizes to) a polynucleotide encodmg the tumor protein. The amplified cDNA is 

0 then separated and detected using techniques well known in the art, such as gel 
electrophoresis. Similarly, oligonucleotide probes that specifically hybridize to a 
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polynucleotide encoding a tumor protein may be used in a hybridization assay to detect 
the presence of polynucleotide encoding the tumor protein in a biological sample. 

To permit hybridization under assay conditions, oUgonucleotide primers 
and probes should comprise an oligonucleotide sequence that has at least about 60%, 

5 preferably at least about 75% and more preferably at least about 90%, identity to a 
portion of a polynucleotide encoding a tumor protein of the invention that is at least 10 
nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 
oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a 
polypeptide described herein under moderately stringent conditions, as defined above. 

10 Oligonucleotide primers and/or probes which may be usefully employed in the 
diagnostic methods described herein preferably are at least 10-40 nucleotides in length. 
In a preferred embodiment, the oUgonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 
having a sequence as disclosed herein. Techniques for both PGR based assays and 

15 hybridization assays are well known in the art {see, for example, MuUis et al., Cold 
Spring Harbor Symp. Quant Biol, 51 ^IGi, 1987; Eriich ed., PCR Technology, Stockton 
Press, NY, 1989). 

One prefened assay employs RT-PCR, in which PCR is appUed in 
conjunction with reverse transcription. Typically, RNfA is extracted from a biological 

20 sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PCR amplification using at least one specific primer generates a cDNA molecule, which 
may be separated and visualized using, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken firom a test patient and from an 
individual who is not afflicted with a cancer. The amplification reaction may be 

25 performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold 
or greater increase in expression in several dilutions of the test patient sample as 
compared to the same dilutions of the non-cancerous sample is typically considered 
positive. 

In another embodiment, the compositions described herein may be used 
30 as markers for the progression of cancer. In this embodiment, assays as described above 
for the diagnosis of a cancer may be performed over time, and the change in the level of 
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reactive polypeptide(s) or polynucleotide(s) evaluated. For example, the assays may be 
performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 
performed as needed. In general, a cancer is progressing in those patients in whom the 
level of polypeptide or polynucleotide detected increases over time. In contrast, the 
5 cancer is not progressing when the level of reactive polypeptide or polynucleotide either 
remains constant or decreases with time. 

Certain in vivo diagnostic assays may be performed directly on a tumor. 
One such assay involves contacting tumor cells with a binding agent. The bound 
binding agent may then be detected directly or indirectly via a reporter group. Such 

10 binding agents may also be used in histological applications. Alternatively, 
polynucleotide probes may be used within such applications. 

As noted above, to improve sensitivity, multiple tumor protein markers 
may be assayed within a given sample. It vnl\ be apparent that binding agents specific 
for different proteins provided herein, may be combined within a single assay. Further, 

15 multiple primers or probes may be used concurrently. The selection of tumor protein 
markers may be based on routine experiments to determine combinations that results in 
optunal sensitivity. In addition, or altematively, assays for tumor proteins provided 
herein may be combined with assays for other known tumor antigens. 

The present invention further provides kits for use within any of the 

20 above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, reagents, 
containers and/or equipment. For example, one container within a kit may contain a 
monoclonal antibody or fragment thereof that specifically bmds to a tumor protein. 
Such antibodies or fragments may be provided attached to a support material, as 

25 described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or altematively, contain 
a detection reagent as described above that contains a reporter group suitable for direct 
or indirect detection of antibody binding. 

Altematively, a kit may be designed to detect the level of mRNA 

30 encoding a tumor protein in a biological sample. Such kits generally comprise at least 
one oUgonucleotide probe or primer, as described above, that hybridizes to a 
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polynucleotide encoding a tumor protein. Such an oligonucleotide may be used, for 
example, within a PGR or hybridization assay. Additional components that may be 
present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a tumor protein. 

The following Examples are offered by way of illustration and not by 
way of limitation. 



EXAMPLE 1 
Isolation and Characterization of Breast 
1 0 Tumor Polypeptides 

This Example describes the isolation of breast tumor polypeptides from a 
breast tumor cDNA Ubrary. 

A cDNA subtraction library containing cDNA from breast tumor 
subtracted with normal breast cDNA was constructed as follows. Total RNA was 
15 extracted from primary tissues using Trizol reagent (Gibco BRL Life Technologies, 
Gaithersburg, MD) as described by the manufacturer. The polyA+ RNA was purified 
using an oIigo(dT) cellulose column according to standard protocols. First strand 
cDNA was synthesized using the primer supplied in a Clontech PCR-Select cDNA 
Subtraction Kit (Clontech, Palo Alto, CA). The driver DNA consisted of cDNAs from 
20 two normal breast tissues with the tester cDNA being from three primary breast tumors. 
Double-stranded cDNA was synthesized for both tester and driver, and digested with a 
combmation of endonucleases (Mlul, MscI, PvuII, Sail and StuI) which recognize six 
base pairs DNA. This modification increased the average cDNA size dramatically 
compared with cDNAs generated accordhig to the protocol of Clontech (Palo Alto, 
25 CA). The digested tester cDNAs were ligated to two different adaptors and the 
subtraction was performed according to Clontech' s protocol. The subtracted cDNAs 
were subjected to two rounds of PCR amplification, following the manufacturer's 
protocol. The resulting PCR products were subcloned into the TA cloning vector, 
pCRn (Invitrogen, San Diego, CA) and transformed into ElectroMax E. coli DHIOB 
30 cells (Gibco BRL Life, Technologies) by electroporation. DNA was isolated from 
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independent clones and sequenced using a Perkin Elmer/Applied Biosystems Division 
(Foster City, CA) Automated Sequencer Model 373 A. 

Sixty-three distinct cDNA clones were found in the subtracted breast 
tumor-specific cDNA library. The determined one strand (5' or 3') cDNA sequences 
5 for the clones are provided in SEQ E) NO: 1 -61 , 72 and 73, respectively. Comparison of 
these cDNA sequences with known sequences hi the gene bank using the EMBL and 
GenBank databases (Release 97) revealed no significant homologies to the sequences 
provided m SEQ ID N0:14, 21, 22, 27, 29, 30, 32, 38, 44, 45, 53, 57, 72 and 73. The 
sequences of SEQ ID NO: 1, 3, 16, 17, 34, 48, 60 and 61 were found to represent 
10 known human genes. The sequences of SEQ ID NO:2, 4, 23, 39 and 50 were found to 
show some sunilarity to previously identified non-human genes. The remaming clones 
(SEQ IDNO:5-13, 15, 18-20, 24-26, 28, 31, 33, 35-37, 40-43, 46, 47, 49, 51, 52, 54-56, 
58 and 59) were found to show at least some degree of homology to previously 
identified expressed sequence tags (ESTs). 
1 5 Further studies resulted in the isolation of the full-length cDNA sequence 

for the clone of SEQ ID NO:57 (referred to as B71 8P). By computer analysis, the fiiU- 
length sequence was found to contain a putative transmembrane domain at amino acids 
137-158. The full-length cDNA sequence of B718P is provided in SEQ ID NO:504, 
with the cDNA sequence of the open reading firame including stop codon being 
20 provided in SEQ ID NO:505 and the cDNA sequence of the open readmg frame without 
stop codon being provided in SEQ ID NO:506. The full-length amino acid sequence of 
B718P is provided is SEQ ID NO:507. SEQ BD NO;508 represents amino acids 1-158 
of B718P, and SEQ ID NO:509 represents amino acids 159-243 of B718P. 

To determine mRNA expression levels of the isolated cDNA clones, 
25 cDNA clones fi-om the breast subtraction described above were randomly picked and 
colony PCR amplified. Their mRNA expression levels in breast tumor, normal breast 
and various other normal tissues were determined using microarray technology 
(Synteni, Palo Alto, CA). Briefly, the PCR amplification products were arrayed onto 
slides in an array format, with each product occupying a unique location m the array. 
30 mRNA was extracted from the tissue sample to be tested, reverse transcribed, and 
fluorescent-labeled cDNA probes were generated. The microarrays were probed with 
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the labeled cDNA probes, the slides scanned and fluorescence intensity was measured. 
Data was analyzed using Synteni provided GEMTOOLS Software. Of the seventeen 
cDNA clones examined, those of SEQ ID NO:40, 46, 59 and 73 were found to be over- 
expressed in breast tumor and expressed at low levels in all normal tissues tested 

5 (breast, PBMC, colon, fetal tissue, salivary gland, bone marrow, lung, pancreas, large 
intestine, spinal cord, adrenal gland, kidney, pancreas, liver, stomach, skeletal muscle, 
heart, small intestine, skin, brain and human mammary epithelial cells). The clones of 
SEQ ID N0:41 and 48 were foimd to be over-expressed in breast tumor and expressed 
at low levels in all other tissues tested, with the exception of bone marrow. The clone 

10 of SEQ ID NO:42 was found to be over-rcxpressed in breast tumor and expressed at low 
levels in all other tissues tested except bone marrow and spinal cord. The clone of SEQ 
ID NO:43 was found to be over-expressed in breast tumor and expressed at low levels 
in all other tissues tested with the exception of spinal cord, heart and small intestine. 
The clone of SEQ ID NO:51 was found to be over-expressed in breast tumor and 

15 expressed at low levels in all other tissues tested with the exception of large intestme. 
The clone of SEQ ID NO: 54 was found to be over-expressed in breast tumor and 
expressed at low levels in all other tissues tested with the exception of PBMC, stomach 
and small intestine. The clone of SEQ ID NO:56 was found to be over-expressed m 
breast tumor and expressed at low levels in all other tissues tested with the exception of 

20 large and small intestine, human mammary epithelia cells and SCID mouse-passaged 
breast tumor. The clone of SEQ ID NO:60 was found to be over-expressed in breast 
tumor and expressed at low levels in all other tissues tested with the exception of spinal 
cord and heart. The clone of SEQ ID N0:61 was found to be over-expressed m breast 
tumor and expressed at low levels in all other tissues tested vnth the exception of small 

25 mtestine. The clone of SEQ ID NO:72 was found to be over-expressed in breast tumor 
and expressed at low levels in all other tissues tested with the exception of colon and 
salivary gland. 

The results of a Northern blot analysis of the clone SYN18C6 (SEQ ID 
NO:40) are shown in Fig. 1. A predicted protein sequence encoded by SYN18C6 is 
30 provided in SEQ ID NO:62. 
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Additional cDNA clones that are over-expressed in breast tumor tissue 
were isolated from breast cDNA subtraction libraries as follows. Breast subtraction 
libraries were prepared, as described above, by PCR-based subtraction employing pools 
of breast tumor cDNA as the tester and pools of either norma] breast cDNA or cDNA 
5 from other normal tissues as the driver. cDNA clones from breast subtraction were 
randomly picked and colony PGR amplified and their mRNA expression levels in breast 
tumor, normal breast and various other normal tissues were determined using the 
microarray technology described above. Twenty-four distinct cDNA clones were found 
to be over-expressed in breast tumor and expressed at low levels m all normal tissues 

10 tested (breast, bram, liver, pancreas, lung, salivary gland, stomach, colon, kidney, bone 
marrow, skeletal muscle, PBMC, heart, small intestine, adrenal gland, spinal cord, large 
intestine and skin). The determined cDNA sequences for these clones are provided in 
SEQ ID NO:63-87. Comparison of the sequences of SEQ ID NO:74-87 with those in 
the gene bank as described above, revealed homology to previously identified human 

15 genes. No significant homologies were found to the sequences of SEQ ID NO:63-73. 

Three DNA isoforms for the clone B726P (partial sequence provided in 
SEQ IDN0:71) were isolated as follows. A radioactive probe was synthesized from 
B726P by excising B726P DNA from a pTTBlue vector (Novagen) by a BamHI/Xbal 
restriction digest and using the resultmg DNA as the template in a single-stranded PGR 

20 in the presence of [a-32P]dCTP. The sequence of the primer employed for this PGR is 
provided in SEQ ID NO:177. The resulting radioactive probe was used to probe a 
directional cDNA library and a random-primed cDNA library made using RNA isolated 
from breast tumors. Eighty-five clones were identified, excised, pmified and 
sequenced. Of these 85 clones, three were found to each contain a significant open 

25 reading frame. The determined cDNA sequence of the isoform B726P-20 is provided in 
SEQ ID NO: 175, with the corresponding predicted amino acid sequence being provided 
m SEQ ID NO: 176. The determined cDNA sequence of the isoform B726P-74 is 
provided in SEQ ID NO: 178, with the corresponding predicted amino acid sequence 
being provided m SEQ ID NO: 179. The determined cDNA sequence of the isoform 

30 B726P-79 is provided m SEQ ID NO:180, with the corresponding predicted amino acid 
sequence being provided in SEQ ID NO: 181. 
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Eflforts to obtain a full-length clone of B726P using standard techniques 
led to the isolation of five additional clones that represent additional 5* sequence of 
B726P. These clones appear to be alternative spUce forms of the same gene. The 
detennmed cDNA sequences of these clones are provided m SEQ ID NO:464-468, with 
5 the predicted amino acid sequences encoded by SEQ ID NO: 464-467 being provided in 
SEQ ED NO:470-473, respectively. Using standard computer techniques, a 3,681 bp 
consensus DNA sequence (SEQ ID NO:463) was created that contains two large open 
reading frames. The downstream ORF encodes the amino acid sequence of SEQ ID 
NO: 176. The predicted amino acid sequence encoded by the upstream ORF is provided 

10 in SEQ ID NO:469. Subsequent studies led to the isolation of an additional splice form 
of B726P that has 1 84 bp insert relative to the other forms. This 1 84 bp insert causes a 
frameshifl that brings the down stream and upstream ORFs together into a single ORF 
that is 1002 aa in length. The determined cDNA sequence of this alternative spUce 
form is disclosed in SEQ ID NO:474, with the corresponding amino acid sequence 

15 being provided in SEQ ID NO:475. 

Comparison of the cDNA sequence of SEQ ID NO:63 (referred to as 
B723P) with the sequences ia the GeneSeq™ DNA database showed matches to 5 DNA 
sequences (Accession nos. A26456, A37144, A26424, V84525 and T22133), 4 of 
which appear to represent the full-length sequence of the gene. Three of these 

20 sequences encode a 243 amino acid open reading frame (ORF), while one of the DNA 
sequences (Accession no. A37144) contains an extra C at position 35, resulting in a 278 
amino acid ORF. The open reading frame, including stop codon, of the first variant of 
B723P (referred to as B723P-short) is provided m SEQ ID NO:510, with the open 
reading firame without stop codon being provided in SEQ ID N0:511. The open 

25 reading frame, including stop codon, of the second variant of B723P (referred to as 
B723P-long) is provided in SEQ ID NO:512, with the open reading frame without stop 
codon being provided in SEQ ID NO:5I3. The amino acid sequences of B723P-short 
and B723P-long are provided m SEQ ID NO:514 and 515, respectively. Computer 
analysis of these sequences demonstrated the presence of putative transmembrane 

30 domains at anodno acids 233-252 of the B723P-long ORF and ammo acids 198-217 of 
the B723P-short ORF. SEQ ID NO:516, 518 and 519 represent ammo acids 1-197, 
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198-243 and 218-243, respectively of B723P-short. SEQ ID NO:517 represents amino 
acids 1-232 of B723P-long. 

Further isolation of individual clones that are over-expressed in breast 
tumor tissue was conducted usmg cDNA subtraction library techniques described 
5 above. In particular, a cDNA subtraction library containing cDNA JBrom breast tumors 
subtracted with five other normal human tissue cDNAs (brain, liver, PBMC, pancreas 
and normal breast) was utilized m this screemng. From the origmal subtraction, one 
hundred seventy seven clones were selected to be further characterized by DNA 
sequencmg and microarray analysis. Microarray analysis demonstrated that the 
10 sequences in SEQ ID NO:182-251 and 479 were 2 or more fold over-expressed in 
human breast tumor tissues over normal human tissues. No significant homologies 
were found for nineteen of these clones, mcluding, SEQ ID NO: 185, 186, 194, 199, 
205, 208, 211, 214-216, 219, 222, 226, 232, 236, 240, 241, 245, 246 and 479, with the 
exception of some previously identified expressed sequence tags (ESTs). The 
15 remaining clones share some homology to previously identified genes, specifically SEQ 
ID NO:I81-184, 187-193, 195-198, 200-204, 206, 207, 209, 210, 212, 213, 217, 218, 
220, 221, 223-225, 227-231, 233-235, 237-239, 242-244 and 247-251. 

One of the cDNA clones isolated by PGR subtraction as described above 
(SEQ ID NO:476; referred to as B720P) which was shown by microarray to be over- 
20 expressed in breast tumor tissues, was found to be identical to a known keratm gene. 
The full-length cDNA sequence of the known keratin gene is provided in SEQ ID 
NO:477, with the corresponding ammo acid sequence being provided m SEQ ID 
NO:478. Primers were generated based on the sequence of SEQ ID NO:477 and used to 
clone full-length cDNA fi-om mRNA which was obtamed firom total RNA showing high 
25 expression of B720P in real-time PGR analysis. Products were then cloned and 
sequenced. The determined full-length cDNA sequence for B720P is provided in SEQ 
ID NO:484, with the corresponding amino acid sequence being provided in SEQ ID 
NO:485. 

hi further studies, a truncated form of B720P (referred to as B720P-tr) 
30 was identified in breast carcinomas. This antigen was cloned from mRNA derived from 
total breast tumor RNA that showed high expression of B720P-tr in real-tune PGR 
analysis. mRNA was used to generate a pool of cDNA which was then used as a 
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template to amplify the cDNA corresponding to B720P-tr by PGR. The determined 
cDNA sequence for B720P-tr is provided in SEQ ID NO:486. B720P-tr has an ORF of 
708 base pairs which encodes a 236 amino acid protein (SEQ ID NO:487). The size of 
the transcript was confirmed by northern analysis. 
5 Of the seventy clones showing over-expression in breast tumor tissues, 

jSfleen demonstrated particularly good expression levels in breast tumor over normal 
human tissues. The following eleven clones did not show any significant homology to 
any known genes. Clone 19463.1 (SEQ ID N0:1 85) was over-expressed in the majority 
of breast tumors and also in the SCID breast tumors tested (refer to Example 2); 

10 additionally, over-expression was found in a majority of normal breast tissues. Clone 
19483.1 (SEQ ID NO:216) was over-expressed in a few breast tumors, with no over- 
expression in any normal tissues tested. Clone 19470.1 (SEQ ID NO:219) was found to 
be slightly over-expressed in some breast tumors. Clone 19468.1 (SEQ ID NO:222) was 
found to be slightly over-expressed in the majority of breast tumors tested. Clone 

15 19505.1 (SEQ ID NO:226) was found to be slightly over-expressed in 50% of breast 
tumors, as well as in SCID tumor tissues, with some degree of over-expression in found 
in normal breast. Clone 1509.1 (SEQ ID NO:232) was foimd to be over-expressed in 
very few breast tumors, but with a certain degree of over-expression in metastatic breast 
tumor tissues, as well as no significant over-expression found in normal tissues. Clone 

20 19513.1 (SEQ ID NO:236) was shown to be slightly over-expressed in few breast 
tumors, with no significant over-expression levels found in normal tissues. Clone 
19575.1 (SEQ ID NO:240) showed low level over-expression in some breast tumors 
and also in normal breast. Clone 19560.1 (SEQ ID NO:241) was over-expressed in 
50% of breast tumors tested, as well as in some normal breast tissues. Clone 19583.1 

25 (SEQ ID NO:245) was slightly over-expressed in some breast tumors, with very low 
levels of over-expression found in normal tissues. Clone 19587.1 (SEQ ID NO:246) 
showed low level over-expression in some breast tumors and no significant over- 
expression in normal tissues. 

Clone 19520.1 (SEQ ID NO:233), showing homology to clone 102D24 

30 on chromosome llql3.31, was found to be over-expressed in breast tumors and in 
SCE) tumors. Clone 19517.1 (SEQ ID NO:237), showing homology to human PAC 
128M19 clone, was found to be slightly over-expressed in the majority of breast tumors 
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tested. Clone 19392.2 (SEQ ID NO:247), showing homology to human chromosome 
17, was shown to be over-expressed in 50% of breast tumors tested. Clone 19399.2 
(SEQ ID NO:250), showing homology to human Xp22 BAC GSHB-184P14, was 
shown to be slightly over-expressed in a limited number of breast tumors tested. 
5 In subsequent studies, 64 individual clones were isolated from a 

subtracted cDNA library containing cDNA from a pool of breast tumors subtracted with 
cDNA from five normal tissues (brain, liver, PBMC, pancreas and normal breast). The 
subtracted cDNA library was prepared as described above with the following 
modification. A combination of five six-base cutters (Mlul, MscI, PvuII, Sail and StuI) 

10 was used to digest the cDNA instead of Rsal. This resulted in an increase in the average 
insert size from 300 bp to 600 bp. The 64 isolated clones were colony PGR amplified 
and their mRNA expression levels in breast tumor tissue, normal breast and various 
other normal tissues were examined by microarray technology as described above. The 
determined cDNA sequences of 1 1 clones which were found to be over-expressed in 

15 breast tumor tissue are provided in SEQ ID NO:405-415. Comparison of these 
sequences to those in the public database, as outlined above, revealed homologies 
between the sequences of SEQ ID NO:408, 411, 413 and 414 and previously isolated 
ESTs. The sequences of SEQ ID NO:405-407, 409, 410, 412 and 415 were found to 
show some homology to previously identified sequences. 

20 In further studies, a subtracted cDNA library was prepared from cDNA 

from metastatic breast tumors subtracted with a pool of cDNA from five normal tissues 
(breast, bram, lung, pancreas and PBMC) using the PCR-subtraction protocol of 
Clontech, described above. The determined cDNA sequences of 90 clones isolated 
from this library are provided m SEQ ID N0:3 16-404. Comparison of these sequences 

25 with those in the public database, as described above, revealed no significant 
homologies to the sequence of SEQ ID NO:366. The sequences of SEQ ID NO:321- 
325, 343, 354, 368, 369, 377, 382, 385, 389, 395, 397 and 400 were found to show 
some homology to previously isolated ESTs. The remaining sequences were found to 
show homology to previously identified gene sequences. 

30 In yet fiirther studies, a subtracted cDNA library (referred to as 2BT) was 

prepared from cDNA from breast tumors subtracted with a pool of cDNA from six 
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normal tissues (liver, brain, stomach, small intestine, kidney and heart) usmg the PCR- 
subtraction protocol of Clontech, described above. cDNA clones isolated jfrom this 
subtraction were subjected to DNA microarray analysis as described above and the 
resulting data subjected to four modified Gemtools analyses. The first analysis 
5 compared 28 breast tumors with 28 non-breast normal tissues. A mean over-expression 
of at least 2.1 fold was used as a selection cut-off. The second analysis compared 6 
metastatic breast tumors with 29 non-breast normal tissues. A mean over-expression of 
at least 2.5 fold was used as a cut-off. The third and fourth analyses compared 2 early 
SCID mouse-passaged with 2 late SCID mouse-passaged tumors. A mean over- 

10 expression in the early or late passaged tumors of 2.0 fold or greater was used as a cut- 
off. In addition, a visual analysis was performed on the microarray data for the 2BT 
clones. The determined cDNA sequences of 13 clones identified in the visual analysis 
are provided m SEQ ID NO:427-439. The determined cDNA sequences of 22 clones 
identified using the modified Gemtools analysis are provided in SEQ ID NO:440-462, 

15 wherein SEQ ID NO:453 and 454 represent two partial, non-overlapping, sequences of 
the same clone. 

Comparison of the clone sequences of SEQ ID NO:436 and 437 (referred 
to as 263 06 and 262B2) with those in the public databases, as described above, 
revealed no significant homologies to previously identified sequences. The sequences 

20 of SEQ ID NO:427, 429, 431, 435, 438, 441, 443, 444, 445, 446, 450, 453 and 454 
(referred to as 266B4, 266G3, 264B4, 263G1, 262B6, 2BT2^34, 2BT1-77, 2BT1-62, 
2BT1-60,61, 2BT1-59, 2BT1-52 and 2BT1-40, respectively) showed some homology to 
previously isolated expressed sequences tags (ESTs). The sequences of SEQ ID 
NO:428, 430, 432, 433, 434, 439, 440, 442, 447, 448, 449, 451, 452 and 455-462 

25 (referred to as clones 22892, 22890, 22883, 22882, 22880, 22869, 21374, 21349, 
21093, 21091, 21089, 21085, 21084, 21063, 21062, 21060, 21053, 21050, 21036, 
21037 and 21048, respectively), showed some homology to gene sequences previously 
identified in humans. 
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EXAMPLE 2 

Isolation and Characterization of Breast Tumor Polypeptides 
Obtained by PCR-Based Subtraction Using SCDD-Passaged Tumor RNA 

Human breast tumor antigens were obtained by PCR-based subtraction 
5 using SCID mouse passaged breast tumor RNA as follows. Human breast tumor was 
implanted in SCE) mice and harvested on the first or sixth serial passage, as described 
in Patent Application Serial No. 08/556,659 filed 1 1/13/95, U.S. Patent No. 5,986,170. 
Genes found to be differentially expressed between early and late passage SCID tumor 
may be stage specific and therefore useful in therapeutic and diagnostic applications. 

10 Total RNA was prepared from snap frozen SCID passaged human breast tumor from 
both the first and sixth passage. 

PCR-based subtraction was performed essentially as described above. In 
the first subtraction (referred to as T9), RNA from first passage tumor was subtracted 
from sixth passage tumor RNA to, identify more aggressive, later passage-specific 

15 antigens. Of the 64 clones isolated and sequenced from this subtraction, no significant 
homologies were found to 30 of these clones, hereinafter referred to as: 13053, 13057, 
13059, 13065, 13067, 13068, 13071-13073, 13075, 13078, 13079, 13081, 13082, 
13092, 13097, 13101, 13102, 13131, 13133, 13119, 13135, 13139, 13140, 13146- 
13149, and 13151, with the exception of some previously identified expressed sequence 

20 tags (ESTs). The determined cDNA sequences for these clones are provided in SEQ ID 
NO:88-116, respectively. The isolated cDNA sequences of SEQ ID NO:l 17-140 
showed homology to known genes. 

In a second PCR-based subtraction, RNA from sixth passage tumor was 
subtracted from first passage tumor RNA to identify antigens down-regulated over 

25 multiple passages. Of the 36 clones isolated and sequenced, no significant homologies 
were found to nineteen of these clones, hereinafter referred to as: 14376, 14377, 14383, 
14384, 14387, 14392, 14394, 14398, 14401, 14402, 14405, 14409, 14412, 14414- 
14416, 14419, 14426, and 14427, with the exception of some previously identified 
expressed sequence tags (ESTs). The determined cDNA sequences for these clones are 

30 provided in SEQ ID NO: 141-159, respectively. The isolated cDNA sequences of SEQ 
ID NO: 160-174 were found to show homology to previously known genes. 
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Further analysis of human breast tumor antigens through PCR-based 
subtraction using fu:st and sixth passage SCID tumor RNA was performed. Sixty three 
clones were found to be differentially expressed by a two or more fold margin, as 
determined by microarray analysis, i.e., higher expression in early passage tunior over 
5 late passage tumor, or vice versa.. Seventeen of these clones showed no significant 
homology to any known genes, although some degree of homology with previously 
identified expressed sequence tags (ESTs) was found, hereinafter referred to as 20266, 
20270, 20274, 20276, 20277, 20280, 20281, 20294, 20303, 20310, 20336, 20341, 
20941, 20954, 20961, 20965 and 20975 (SEQ ID NO:252.268, respectively). The 

10 remaming clones were found to share some degree of homology to known genes, which 
are identified in the Brief Description of the Drawings and Sequence Identifiers section 
above, heremafter refenred to as 20261, 20262, 20265, 20267, 20268, 20271, 20272, 
20273, 20278, 20279, 20293, 20300, 20305, 20306, 20307, 20313, 20317, 20318, 
20320, 20321, 20322, 20326, 20333, 20335, 20337, 20338, 20340, 20938, 20939, 

15 20940, 20942, 20943, 20944, 20946, 20947, 20948, 20949, 20950, 20951, 20952, 
20957, 20959, 20966, 20976, 20977 and 20978. The determined cDNA sequences for 
these clones are provided m SEQ ID NO:269-314, respectively. 

The clones 20310, 20281, 20262, 20280, 20303, 20336, 20270, 20341, 
20326 and 20977 (also referred to as B820P, B821P, B822P, B823P, B824P, B825P, 

20 B826P, B827P, B828P and B829P, respectively) were selected for further analysis 
based on the results obtained with microarray analysis. Specifically, microarray data 
analysis indicated at least two- to three-fold overexpression of these clones in breast 
tumor RNA compared to normal tissues tested. Subsequent studies led to the 
determination of the complete msert sequence for the cloiies B820P, B821P, B822P, 

25 B823P, B824P, B825P, B826P, B827P, B828P and B829P. These extended cDNA 
sequences are provided in SEQ ID NO:416-426, respectively. 

EXAMPLE 3 ' 
Synthesis of Polypeptides 
30 Polypeptides may be synthesized on an Perkin Elmer/Applied 

Biosystems Division 430A peptide synthesizer using FMOC chemistry with HPTU (O- 
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Benzotriazole-N,N^',N'4etramethylxironium hexafluorophosphate) activation. A Gly- 
Cys-Gly sequence may be attached to the amino terminus of the peptide to provide a 
method of conjugation, binding to an immobilized surface, or labeling of the peptide. 
Cleavage of the peptides from the solid support may be carried out using the following 

5 cleavage mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol 
(40:1:2:2:3). After cleaving for 2 hours, the peptides may be precipitated in cold 
raethyl-t-butyl-ether. The peptide pellets may then be dissolved in water containing 
0.1% trifluoroacetic acid (TFA) and lyophilized prior to purification by C18 reverse 
phase HPLC. A gradient of 0%-60% acetonitrile (contaimng 0.1% TFA) in water 

10 (containing 0.1% TFA) may be used to elute the peptides. Following lyophiUzation of 
the pure fractions, the peptides may be characterized using electrospray or other types of 
mass spectrometry and by amino acid analysis. 

EXAMPLE 4 

1 5 Elicitation Of Breast Antigen-Specific Ctl Responses 

In Human Blood 

This Example illustrates the ability of the breast-specific antigen B726P 
to eUcit a cytotoxic T lymphocyte (CTL) response in peripheral blood lymphocytes from 
normal humans. 

20 Autologous dendritic cells (DC) were differentiated from monocyte 

cultures derived from PBMC of a normal donor by growth for five days in RPMI 
medium containing 10% human serum, 30 ng/ml GM-CSF and 30 ng/ml IL-4. 
Following five days of culture, DC were infected overnight with adenovirus expressmg 
recombinant B726P (downstream ORF; SEQ ID NO:176) at an M.O.I. of 2.5 and 

25 matured for 8 hours by the addition of 2 micrograms/ml CD40 ligand. CDS positive 
cells were enriched for by the depletion of CD4 and CD14-positive cells. Priming 
cultures were mitiated in mdividual wells of several 96-well plates with the cytokines 
IL-6 and IL-12. These cultures were restimulated in the presence of IL-2 using 
autologous fibroblasts treated with IFN-gamma and transduced with B726P and CD80. 

30 Following three stimulation cycles, the presence of B726P-specific CTL activity was 
assessed in IFN-gamma Elispot assays (Lalvani et al., /. Exp, Med iS(5:859-865, 1997) 
using IFN-gamma treated autologous fibroblasts transduced to express either B726P or 
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an irrelevant, control, antigen as antigen presenting cells (APC). Of approximately 96 
lines, one line (referred to as 6-2B) was identified that appeared to specifically 
recognize B726P-transduced APC but not control antigen-transduced APC. This 
microculture was cloned using standard protocols. B726P-specific CTL were identified 
5 by Elispot analysis and expanded for further analysis. These CTL clones were 
demonstrated to recognize B726P-expressing fibroblasts, but not the control antigen 
MART-1, using chromium-51 release assays. Furthermore, using a panel of allogeneic 
fibroblasts transduced with B726P in antibody blocking assays, the HLA restriction 
element for these B726P-specific CTL was identified as HLA-B*1501 . 

10 In order to define more accurately the location of tiie epitope recognized 

by the B726P-specific GIL clones, a deletion construct comprising only the N-tenninal 
half (a.a. 1-129) of B726P (referred to as B726Pdelta3^ was constructed in the pBIB 
retroviral expression plasmid. This plasmid, as well as other plasmids containing 
B726P, were transfected into C0S*7 cells either alone or in combination with a plasmid 

15 expressing HLA-B*1501. Aproximately 48 hours after transfection, a B726P-specific 
CTL clone (1-9B) was added at approximately lO'* cells per well. The cells were 
harvested the next day and the amount of IFN-gamma released was measured by 
ELISA. The CTL responded above background (EGFP) to COS-7 cells that had been 
transfected witii both B726P and HLA-B*1501. There was no response above 

20 background to COS-7 cells that had been transfected with either B726P or HLA- 
B*1501 alone. Importantly, a higher response was seen with COS-7 cells that had been 
transfected with both HLA-B*1501 and B726PdeIta3'. This resuh indicated that the 
epitope was likely to be located in the N-tenninal region (a.a. 1-129) of B726P. This 
region was examined and anuno acid sequences tiiat corresponded to the HLA-B*1501 

25 peptide binding motif (J. ImmunoL1999,162:7217-84) were identified and synthesized. 
These peptides were pulsed at 10 ug/ml onto autologous B-LCL overnight. The next 
day, the cells were washed and the ability of the cells to stimulate the B726P-specific 
CTL clone 1-9B was assayed in a IFN-gamma ELISPOT assay. Of the eleven peptides 
tested, only one peptide, having the amino acid sequence SLTKRASQY (a.a. 76-84 of 

30 B726P; SEQ ID NO: 488) was recognized by the CTL clone. This result identifies this 
peptide as being a naturally-processed epitope recognized by this B726P-specific CTL 
clone. 
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In further studies, a panel of breast tumor cell lines obtained from the 
American Type Culture Collection (Manassas, VA), was analyzed using real time PGR 
to determine their B726P message level. The cell line that expressed the highest level 
of B726P (referred to as HTB21) and a line that expressed no B726P (referred to as 
5 HTB132) were transduced with HLA-B*150L These cell lines were grown up and 
analyzed using FACS to determine their B1501 expression. The line HTB 21 was 
found to endogenously express B1501. To determine if clone 1-9A would recognize die 
tumor cell line HTB21, an IFN-gamma ELISPOT assay was performed using 20,000 T 
cells, low dose JL-2 (5 ug/ml), and 20,000 of the following targets: autologous B726P 
10 or Mart-1 fibroblasts, untransduced or BlSOl-transduced HTB21; or untransduced or 
B1501-transduced HTB132. These were incubated overnight and the assay was 
developed the next day. The results of this assay are shown in Figure 2. These studies 
demonstrate that B726P-specific CTL can recognize and lyse breast tumor cells 
expressing B726P. 

15 

EXAMPLES 

Identification of Immunogenic CD4 T Cell Epitopes in Breast Antigens 

Immunogenic CD4 T cell epitopes derived from the breast antigen 
B726P were identified as follows. 

20 A total of thkty-five 20-mer peptides overlapping by 12 amino acids and 

derived from the downstream ORE of B726P (corresponding to amino acids 1-317 of 
SEQ ID NO: 176) were generated by standard procedure. Dendritic cells (DC) were 
derived from PBMC of a normal male donor using GMCSF and IL-4 by standard 
protocol. Purified CD4 T cells were generated from the same donor as the DC using 

25 MACS beads and negative selection of PBMCs. DC were pulsed overnight with pools 
of the 20-mer peptides, with each peptide at an individual concentration of 0.5 
micrograms/mL. Pulsed DC were washed and plated at 10,000 cells/weU of 96-well U 
bottom plates, and purified CD4 T cells were added at 100,000 cells/well. Cultures 
were supplemented with 10 ng/mL IL-6 and 5 ng/mL IL-12 and incubated at 37 °C. 

30 Cultures were restimulated as above on a weekly basis using DC made and pulsed as 
above as the antigen presentmg cell, supplemented with 10 u/mL IL-2 and 5 ng/mL IL- 
7. Following three in vitro stimulation cycles (the initial p rimin g + two resthnulations). 
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cell lines (each corresponding to one well) were tested for specific proliferation and 
cytokine production in response to the stimulating pool versus an irrelevant pool of 
peptides derived from unrelated antigens. A number of individual CD4 T cell lines 
(36/672 by IFN-gamma and 64/672 by proliferation) demonstrated significant cytokine 
5 release (EFN-gamma) and proliferation in response to the B726P peptide pools but not 
to the control peptide pool. Twenty-five of these T cell lines were restimulated on the 
appropriate pool of B726P peptides and reassayed on autologous DC pulsed with either 
the individual peptides or recombinant B726P protein made in E, coli. Approximately 
14 hnmunogenic peptides were recognized by the T cells fi-om the entire set of peptide 

10 antigens tested. The amino acid sequences of these 14 peptides are provided in SEQ ED 
NO:534 - 547, with the corresponding DNA sequences being provided in SEQ ID 
NO:520-533, respectively, hi some cases the peptide reactivity of the T cell line could 
be mapped to a single peptide but some could be mapped to more than one peptide m 
each pool. Thirteen of the fifteen T cell lines recognized the recombinant B726P 

15 protein. These results demonstrate that 13 of the 14 peptide sequences (SEQ ID 
NO:534-542 and 544-547) may be naturally processed CD4 epitopes of the B726P 
protein. 

EXAMPLE 6 

20 Preparation and Characterization of Antibodies against 

Breast Tumor Antigen B726P 
Polyclonal antibodies agamst both the downstream (SEQ ID NO: 176) 
arid upstream (SEQ ID NO:469) ORF of the breast tumor antigen B726P were prepared 
as follows. 

25 The downstream or upstream ORF of B726P expressed in an £. coli 

recombinant expression system was grown overnight in LB broth with the appropriate 
antibiotics at 37 °C in a shaking incubator. The next morning, 10 ml of the overnight 
culture was added to 500 ml to 2x YT plus appropriate antibiotics in a 2L-bafQed 
Erienmeyer flask. When the Optical Density (at 560 nm) of the culture reached 0.4-0.6, 

30 the cells were induced with IPTG (1 mM), Four hours after mduction with EPTG, the 
cells were harvested by centrifugation. The cells were then washed with phosphate 
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buffered saline and centrifuged again. The supernatant was discarded and the cells were 
either frozen for future use or immediately processed. Twenty ml of lysis buffer was 
added to the cell pellets and vortexed. To break open the E. coli cells, this mixture was 
then run through the French Press at a pressure of 16,000 psi. The cells were then 
5 centrifiiged again and the supernatant and pellet were checked by SDS-PAGE for the 
partitioning of the recombinant protein. For proteins that localized to the cell pellet, the 
pellet was resuspended in 10 mM Tris pH 8.0, 1% CHAPS and the inclusion body pellet 
was washed and centrifuged again. This procedure was repeated twice more. The 
washed inclusion body pellet was solubilized with either 8 M urea or 6 M guanidine 

10 HCl containing 10 mM Tris pH 8.0 plus 10 mM imidazole. The solubilized protein was 
added to 5 ml of nickel-chelate resin (Qiagen) and incubated for 45 min to 1 hour at 
room temperature with continuous agitation. After incubation, tiie resin and protein 
mixture were poured through a disposable column and the flow through was collected. 
The column was then washed with 10-20 column volumes of the solubilization buffer. 

15 The antigen was then eluted from the column using 8M urea, 10 mM Tris pH 8.0 and 
300 mM imidazole and collected in 3 ml fractions. A SDS-PAGE gel was run to 
determine which fractions to pool for further purification. 

As a final purification step, a strong anion exchange resin, such as 
HiPrepQ (Biorad), was equilibrated with the appropriate buffer and the pooled fractions 

20 from above were loaded onto the colunrn. Antigen was eluted off the column with a 
increasing salt gradient. Fractions were collected as the column was run and another 
SDS-PAGE gel was run to determine which firactions from the colunrn to pool. The 
pooled fractions were dialyzed against 10 mM Tris pH 8.0. The protein was then vialed 
after filtration through a 0.22 micron filter and the antigens were frozen until needed for 

25 immunization. 

Four hundred micrograms of the B726P antigen was combined with 100 
micrograms of muramyldipeptide (MDP). Every four weeks rabbits were boosted with 
100 micrograms mixed with an equal volume of Incomplete Freund's Adjuvant (IF A). 
Seven days following each boost, the animal was bled. Sera was generated by 
30 incubating the blood at 4 °C for 12-24 hours followed by centrifugation. 
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Ninety-six well plates were coated with B726P antigen by incubating 
with 50 microliters (typically 1 microgram) of recombinant protein at 4 °C for 20 hours. 
250 Microliters of BSA blocking buffer was added to the wells and incubated at room 
temperature for 2 hours. Plates were washed 6 times with PBS/0.01% Tween. Rabbit 
5 sera was diluted in PBS. Fifty microliters of diluted sera was added to each weU and 
incubated at room temperature for 30 min. Plates were washed as described above 
before 50 microliters of goat anti-rabbit horse radish peroxidase (HRP) at a 1:10000 
dilution was added and incubated at room temperature for 30 min. Plates were again 
washed as described above and 100 microliters of TMB naicrowell peroxidase substrate 
10 was added to each well. Following a 15 min incubation in the dark at room 
temperature, the colorimetric reaction was stopped with 100 microliters of IN H2SO4 
and read immediately at 450 nm. All the polyclonal antibodies showed 
immunoreactivity to the appropriate B726P antigen. 

15 Preparation of polycloaal antibodies against B709P and B720P 

The breast antigens B709P (SEQ ID NO: 62) and B720P (SEQ ID NO: 
485) expressed in an £. coli recombinant expression system were grown overnight in 
LB Broth with the appropriate antibiotics at 37°C in a shaking incubator. Ten ml of the 
overnight culture was added to 500 ml of 2x YT plus appropriate antibiotics in a 2L- 

20 baffled Erlenmeyer flask. When the optical density (at 560 nanometers) of the culture 
reached 0.4-0.6, the cells were induced with IPTG (1 mM). Four hours after induction 
- with IPTG, the cells were harvested by centrifugation. The cells were washed with 
phosphate buffered saline and centrifiiged again. The supernatant was discarded and the 
cells were either frozen for future use or immediately processed. Twenty milliliters of 

25 lysis buffer was added to the cell pellets and vortexed. To break open the E, coli cells, 
the mixture was run through a French Press at a pressure of 16,000 psi. The cells were 
centrifiiged agam and the supernatant and pellet were checked by SDS-PAGE for the 
partitioning of the recombinant protein. For protems that localized to the cell pellet, the 
pellet was resuspended in 10 mM Tris pH 8.0, 1% CHAPS and the inclusion body pellet 

30 was washed and centrifiiged again. This procedure was repeated twice more. The 
washed inclusion body pellet was solubihzed with either 8 M urea or 6 M guanidine 
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HCl containing 10 mM Tris pH 8.0 plus 10 mM imidazole. The solubilized protein was 
added to 5 ml of nickel-chelate resin (Qiagen) and incubated for 45 min to 1 hour at 
room temperature (RT) with continuous agitation. After incubation, the resin and 
protein mixture were poured through a disposable column and the flow through was 
5 collected. The column was then washed with 10-20 column volumes of the 
solubilization buffer. The antigen was then eluted from the column using 8M urea, 10 
mM Tris pH 8.0 and 300 mM imidazole and collected in 3 ml fractions. A SDS-PAGE 
gel was run to determine which fractions to pool for fiirther purification. As a final 
purification step, a strong anion exchange resin such as Hi-Prep Q (Biorad) was 

10 equilibrated with the appropriate buffer and the pooled fractions from above were 
loaded onto the column. Each antigen was eluted off of the column v^th an increasing 
salt gradient. Fractions were collected as the coliimn was run and another SDS-PAGE 
gel was run to determine which fractions from the column to pool. The pooled fractions 
were dialyzed against 10 mM Tris pH 8.0. The proteins were then vialed after filtration 

1 5 through a 0.22-micron filter and frozen until needed for inamunization. 

Four hundred micrograms of antigen was combined with 100 
micrograms of muramyldipeptide (MDP). An equal volume of Incomplete Freund's 
Adjuvant (IF A) was added and mixed, and the mixture was injected into a rabbit. The 
rabbit was boosted with 100 micrograms of antigen mixed with an equal volume of IFA 

20 every four weeks. The animal was bled seven days following each boost. Sera was 
generated by incubating the blood at 4^C for 12-24 hours followed by centrifiigation. 

The reactivity of the polyclonal antibodies to recombmant antigen 
(B709P or B720P) was determined by ELISA as follows. Ninety-six well plates were 
coated with antigen by incubatmg with 50 microliters (typically 1 microgram) at 4°C for 

25 20 hrs. 250 microliters of BSA blocking buffer was added to the wells and incubated at 
RT for 2 hrs. Plates were washed 6 times with PBS/0.01% Tween. Rabbit sera were 
diluted in PBS. Fifty microliters of diluted sera was added to each well and incubated at 
RT for 30 min. Plates were washed as described above before 50 microliters of goat 
anti-rabbit horse radish peroxidase (HRP) at a 1:10000 dilution was added and 

30 incubated at RT for 30 min. Plates were washed as described above and 100 microliters 
of TMB Microwell Peroxidase Substrate was added to each well Following a 15- 
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minute incubation in the dark at RT, the colorimetric reaction was stopped with 100 
microliters of IN H2SO4 and read immediately at 450 nm. The polyclonal antibodies 
showed immunoreactivity to the appropriate antigen. 

5 EXAMPLE 7 

PROTEIN EXPRESSION OF BREAST TUMOR ANTIGENS 

The downstream ORE of B726P (SEQ JD NO:176), together with a C- 
terminal 6X His Tag, was expressed in insect cells using the baculovirus expression 
system as follows. 

10 The cDNA for the M-length downstream ORE of B726P was PGR 

ampUfied using the primers of SEQ ID NO:480 and 481. The PGR product with the 
expected size was recovered from agarose gel, restriction digested with EcoRI and Hind 
n, and ligated into the transfer plasmid pFastBacl, which was digested with the same 
restriction enzymes. The sequence of the insert was confirmed by DNA sequencing. 

15 The recombinant transfer plasmid pFBB726P was used to make recombinant bacmid 
DNA and virus using the Bac-To-Bac Baculovirus expression system (BRL Life 
Technologies, Gaithersburg, MD). High Five cells were infected with the recombinant 
virus BVB726P to produce protein. The cDNA and amino acid sequences of the 
expressed B726P recombinant protein are provided in SEQ ID NO:482 and 483, 

20 respectively. 

EXAMPLES 

Generation of constructs for Protcin expression of B726P in e. con 

Three different open reading frames (ORFs) of B726P were subcloned 
25 into pPDM, a modified pET28 vector for exression in E. coli. 

Construct for the expression of B726P Upstream ORE in E. colt 
(cDNA: SEQ ID NO:549; amino acid: SEQ BD NO:552): 

The partial B726P upstream ORE (A) firom clone 23113 was PGR 
amplified with the following primers: 
30 PDM-416 (SEQ ID NO:554) 5' gtcggctccatgagtcccgcaaaag 3' Tm 63^C. 
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PDM-431 (SEQ ID NO:555) 5' cgagaattcaatacttaagaagaccatctttaccag 3' 

Tm61^C. 

The amplification conditions were as follows 10 [xl lOX Pfu buffer, 1 [xl 
10 [iM dNTPs, 2 (J 10 |biM each oligo, 83 [iJ sterile water, 1.5 [jd Pfii DNA polymerase 
5 (Stratagene, La Jolla, CA), 1 \il PGR 23113. The reaction was first denatured for 2 
minutes at 96°C, followed by 40 cylces of 96°C for 20 seconds, 62°C for 15 seconds, 
and extension at 72**C for 2 minutes. This was followed by a final extension of 72°C for 
4 minutes. 

Hie second partial B726P upstream ORF (B) from clone 193 10 was PGR 
1 0 amplified with the following primers : 

PDM-432 (SEQ JD NO:556) 5' cataagcttaaggctaactgcggaatgaaag 3' Tm 

63°C. 

PDM-427 (SEQ ID NO:557) 5' cccgcagaattcaacatgcaattttcatgtaagag 3' 

Tm 62°C. 

15 The amplification and cycling conditions were as described above. The 

first PGR product was digested with EcoRI and cloned into pPDM His (a modified 
pET28 vector) that had been digested with EcoRI and Eco72L The second PGR product 
was digested with Bfrl and EcoRI and cloned uito the resulting construct: pPDM B726P 
UP-A-5 at the EcoRI and Bfrl sites. The construct (pPDM B726P Up-4) was confirmed 
20 to be correct through sequence analysis and transformed into BL21 (DE3) pLys S and 
BL21 GodonPlus RIL (DE3) cells. Protein expression was confirmed by Coomassie 
stained SDS-PAGE andN-terminal protein sequence analysis. 

Construct for B726P D-ORF expression in E. coli (cDNA: SEQ ID 
NO:S50; amino acid: SEQ ID NO:551): 
25 The B726P D-ORF was PGR amplified with the following primers: 

PDM-290 (SEQ ID NO:558) 5' ctaaatgccggcacaagagctctgc 3' Tm 61°C 
PDM-291 (SEQ ID NO:559) 5' cgcgcagaattctattatataacttctgtttctgc 3' 

Tm61°G 

The reaction conditions were as described. The cycling conditions were 
30 altered slightly in that the annealing temperature was lowered to 61**C from 62°C and 
was held for 15 seconds. The extension time was also increased to 2 minutes and 15 
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seconds. The PGR product was digested with Nael and EcoRI and cloned into pPDM 
His which has been digested with Eco72I and EcoRI. Construct was confirmed by 
sequencing and then transformed into BL21 (DE3) pLys S cells (Novagen, Madison, 
WI). Protein expression was confirmed by Coomassie stained SDS-PAGE and N- 
5 terminal protein sequence analysis. 

Construct for B726P Combined ORF expression in E. coli (cDNA: 
SEQ ID NO:548; amino acid: SEQ ID NO:553): 

The B726P C-1 coding region was PGR amplified including the 183 bp 
insert, with the foUowmg primers: 
10 PDM-750 (SEQ ID NO:560) 5' ggggaattgtgagcggataacaattc 3' Tm 58^C 

PDM-752 (SEQ ID NO:561) 5' cgtagaattcaacctgatttaaattactttctacac 3' 

Tm 59*^0 

The B726P Downstream ORF was PGR amplified with the followmg 

primers: 

15 PDM-753 (SEQ ID NO:562) 5' gaaagtaatttaaatcaggtttctcacactc 3' Tm 

59°G 

PDM-751 (SEQ ID NO:563) 5' gaggccccaaggggttatgctag 3' Tm 61**C 
The reaction conditions for these PGR reactions were the same as 
described above. The cycling conditions were as follows: 1^ PGR: The reaction was 

20 first denatured for 2 minutes at 96**G, followed by 40 cylces of 96''C for 20 seconds, 
58°C for 15 seconds, and extension at 72**C for 4 minutes. This was followed by a final 
extension of 72°G for 4 nunutes; 2"*^ PGR: . The reaction was first denatured for 2 
mmutes at 96°G, followed by 40 cylces of 96°G for 20 seconds, 59''C for 15 seconds, 
and extension at 72°G for 2 minutes. This was followed by a final extension of 72^C for 

25 4 minutes. The first PGR product was digested with EcoRI and cloned into pPDM His 
(a modified pET28 vector) at the Eco 721 and EcoRI sites. The construct was confirmed 
to be correct through sequence analysis. The second PGR product was digested with 
EcoRI and cloned into pPDM His at the same sites. The resultmg constructs pPDM 
B726P UA-8 and pPDM B726P DA-7 respectively were digested with Swal and EcoRI. 

30 The pPDM B726P UA-8 construct was used as vector and the insert from the pPDM 
B726P DA-7 was cloned into this construct successfully. The construct was confirmed 
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to be correct through sequence analysis and then transformed into BLR (DE3) pLys S 
and HMS 174 (DE3) pLys S cells (Novagen, Madison, WI). Protein expression was 
confirmed by Coomassie stained SDS-PAGE and N-temainal protein sequence analysis. 

5 EXAMPLE 9 

Additional Sequence Identified for Breast Tumor Antigen B726P by 

BlOINFORMATIC ANALYSIS 

The combined ORF of the breast tumor antigen, B726P (amino acid 
sequence set forth in SEQ JD NO:475), was used to search public databases. A 
10 sequence essentially identical to the combined ORF with additional N-terminal 
sequence was identified in the GenBank nonredundant protein database and the cDNA 
and predicted amino acid sequences are set forth in SEQ ID NO:564 and 565, 
respectively. The gene is also referred to as NY-BR-1 and was described in described 
in Cancer Research 61(5):2055-2061, March 1, 2001. 

15 

EXAMPLE 10 

Analysis of B726P Expression Using Immunohistochemistry 

Affinity purified polyclonal antibodies anti-B726Pup (generated against 
the B726P upstream ORF protein) and anti-B726Pdown (generated against the B726P 

20 downstream ORF) were used to assess B726P protein expression in breast cancer and in 
a variety of normal tissue sections. 

In order to determine which tissues express the breast cancer antigen 
protein B726P immunohistochemistry (IHC) analysis was performed on a diverse range 
of tissue sections. Tissue samples were fixed in formalin solution for 12-24 hrs and 

25 embedded in paraffin before being sliced into 8 micron sections. Steam heat induced 
epitope retrieval (SHIER) in 0.1 M sodium citrate buffer (pH 6.0) was used for optimal 
staining conditions. Sections were incubated with 10% serum/PBS for 5 minutes. 
Primary antibody (either rabbit affinity purified anti-B726Pdown or anti-B726Pup) was 
added to each section for 25 minutes followed by 25 minute incubation with anti-rabbit 

30 biotinylated antibody. Endogenous peroxidase activity was blocked by three 1.5 minute 
incubations with hydrogen peroxidase. The avidin biotin complex/horse radish 
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peroxidase (ABC/HRP) system was used along with DAB chromogen to visualize 
antigen expression. Slides were counterstainied with hematoxylin to visualize cell 
nuclei. Anti-B726Pup and anti-B726Pdown inununoreactivity was observed in about 
30-40% of breast cancer samples analyzed but not observed in a majority of various 
5 normal tissues. Anti-B726Pdown and anti-b726Pup also stained roughly the same 
breast cancer samples. Thus, these data confirm earlier microarray analysis (see 
Example 1) showing that B726P is overexpressed in breast tumor tissue as compared to 
normal tissue. Therefore, this antigen may be used in diagnostic and 
immunotherapeutic applications for breast cancer. 

10 

EXAMPLE 11 

Generation of monoclonal antibodies to B726P Downstream and Upstream 

ORFs 

Production and purification of protein used for antibody generation. 

15 B726 upstream ORE and B726 downstream ORE proteins were expressed in an £. coli 
recombinant expression system (see Example 8). Cells were grown overnight in LB 
Broth with the appropriate antibiotics at 37°C in a shaking incubator. The next 
morning, 10 ml of the overnight culture was added to 500 ml of 2x YT plus appropriate 
antibiotics in a 2L-baffled Erlenmeyer flask. When the optical density (at 560 

20 nanometers) of the culture reached 0.4-0.6 the cells were induced with IPTG (1 mM). 
Four hours after induction with IPTG the cells were harvested by pentrifugation. The 
cells were then washed with phosphate buffered salme and centrifuged again. The 
supernatant was discarded and the cells were either frozen for future use or immediately 
processed. Twenty milliliters of lysis buffer was added to the cell pellets and vortexed. 

25 To break open the E. coli cells, this mixture was then ran through the Erench Press at a 
pressure of 16,000 psi. The cells were then centrifuged again and the supernatant and 
pellet were checked by SDS-PAGE for the partitioning of the recombinant protein. Eor 
proteins that localized to the cell pellet, the pellet was resuspended in 10 mM Tris pH 
8.0, 1% CHAPS and the inclusion body pellet was washed and centrifuged again. This 

30 procedure was repeated twice more. The washed inclusion body pellet was solubilized 
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with either 8 M urea oi 6 M guanidine HCl containing 10 mM Tris pH 8.0 plus 10 mM 
imidazole. 

The solubilized protein was added to 5 ml of nickel-chelate resin 
(Qiagen, Valencia, CA) and incubated for 45 min to 1 hour at room temperature with 
5 continuous agitation. After incubation, the resin and protein mixture were poured 
through a disposable column and the flow through was collected. The column was then 
washed with 10-20 column volumes of the solubilization buffer. The antigen was then 
eluted from the column using 8M urea, 10 mM Tris pH 8.0 and 300 mM hnidazole and 
collected in 3 ml fractions. A SDS-PAGE gel was run to determine which fractions to 

10 pool for further puriJBcation. As a final -purification step, a strong anion exchange resin 
such as Hi-Prep Q (Biorad) was equilibrated with the appropriate buffer and the pooled 
fractions from above were loaded onto the column. Each antigen was eluted off of the 
column with an increasing salt gradient. Fractions were collected as the column was 
run and another SDS-PAGE gel was run to determine which fractions from the column 

15 to pool. The pooled fractions were dialyzed against 10 mM Tris pH 8.0. This material 
was then submitted to Quality Control for final release. The release criteria were purity 
as determined by SDS-PAGE or HPLC, concentration as determined by Lowry assay or 
Amino Acid Analysis, identity as determined by amino terminal protein sequence, and 
endotoxin level as determined by the Limulus (LAL) assay. The proteins were then 

20 vialed after filtration through a 0.22-raicron filter and the antigens were frozen until 
needed for immunization. 

To generate anti-B726P mouse monoclonal antibodies, mice were 
immunized IP with 50 micrograms of recombinant B726P upstream OKF and B726P 
downstream ORF proteins that had been mbted to form an emulsion with an equal 

25 volume of Complete Freund's Adjuvant (CFA). Every three weeks animals were 
injected IP with 50 micrograms of recombinant B726P upstream ORF and B726P 
downstream ORF that had been mixed with an equal volume of IFA to form an 
emulsion. After the fourth injection, spleens were isolated and standard hybridoma 
fusion procedures were used to generate anti-B726P mouse monoclonal antibody 

30 hybridomas. Anti-B726P monoclonal antibodies were screened using the ELISA 
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analysis using the bacterially expressed recombinant B726P upstream ORF and B726P 
downstream ORF proteins. 

A list of the mouse anti-B726P monoclonal antibodies that were 
generated, as well as their anti-B726P reactivity in an ELISA assay and Western blot 
5 are shown in Table 2. The hybridomas were then subcloned and the subclones further 
tested for reactivity with B726P upstream ORF and B726P downstream ORF proteins. 
Several monoclonal antibodies showed particularly favorable reactivity: 220A2-21, 
220A19-25, 220A94-29, 220A151-33. 

For Western blot analysis, recombinant B726P upstream ORF and 
10 B726P downstream ORF protein was diluted with SDS-PAGE loading buffer 
containing beta-mercaptoethanol, then boiled for 10 minutes prior to loading the SDS- 
PAGE gel. Protein was transferred to nitrocellulose and probed with each of the anti- 
B726P hybridoma supernatants. Protein A-HRP was used to visualize the anti-B726P 
reactive bands by incubation in ECL substrate. 



Table 2: B726Pup and B726Pdown Monoclonal Antibody REAcrrvrrY 





ELISA 


Western Blots 


Anti-B726P 
mAbs 


B726PDown 


B726PUp 


L523S 


B726Pdown 


B726Pup 


220A2 


+++ 


+ 




+++ 


++ 


220A10 








N/A 


N/A 


220A14 


+++ 


+++ 


+++ 


+++ 


++ 


220A19 


++ 






++ 


+ 


220A43 


+++ 


+ 




+++ ' 


++ 


220A86 


-H-+ 


+ 




+++ 


++ 


220A94 


+++ 








+/- 


220A123 


++ 






+ 




220A139 


+/- 






+ 




220A140 








N/A 


N/A 


220A141 








N/A 


N/A 


220A143 








N/A 


N/A 
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220A151 


++ 






++ 




220A176 


+/- 






+ 





EXAMPLE 12 
Identification of additional sequences for B726P 
Additional 5' sequence was obtained for B726P - this sequence was 
5 obtained by PGR from 1st strand cDNA prepared from three separate mRNA sources 
(metastatic breast tumor, breast tumor, normal testis). Disclosed herein are clones that 
were isolated, each with differences from the expected published sequence of lST-BR-1. 

A 1300 bp fragment of B726P otherwise known as NY-BR-1 was PGR 
amplified from 1st strand cDNA and cloned into pPDM, a modified pET28 vector as 
10 follows: 

The B726P XB coding region (lSfY-BR-1) was PGR amplified with the 
following primers 

PDM-784 5' cacacaaagaggaagaagaccatc 3' Tm 56°C 
PDM-814 5' gattcttttgtaggacatgcaatcatc 3' Tm55°G 

15 

The following PGR conditions were used: 10|xl I OX Herculase buffer, 
lnJ lOmM dNTPs, 2|il 10.|aM each oligo, 83m1 sterile water, l,5\jd Herculase DNA 
polymerase (Stratagene, La JoUa, CA), 50 ng DNA. The thermalcycling conditions 
were as follows: 
20 98**C 3 minutes 

98°G 40 seconds, 51°C 15 seconds, 72**C 4 minutes, X 10 cycles 
98°C 40 seconds, Sl'^C 15 seconds, 72°C 5 minutes, X 10 cycles 
98°G 40 seconds, src 15 seconds, 72*'G 6 minutes, X 10 cycles 
98'*C 40 seconds, Sl'^C 15 seconds, 72*'C 7 minutes, X 10 cycles 
25 72°G 10 minutes 

The PGR product was ligated uito the pPDM vector (a modified pET28) 
that had been digested with Eco72I and de-phosphorylated. PGR amplification of this 
gene proved very diJBBcult and required the use of a polymerase lacking proofreading 
capabilities. However, use of such an enzyme, in this case, Herculase from Stratagene 
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(La Jolla, CA), led to what is likely PGR errors in the resulting clones. The cDNA 
sequence of three of the isolated clones containing mutations are disclosed in SEQ ID 
NO:567-569 with the corresponding amino acid sequences disclosed in SEQ ID 
NO:572, 571, and 570, respectively. 

5 The resulting construct, pPDM B726P XB (clone 83686), was then 

digested with BgUI and the insert which dropped out from the 5' vector BgUI site and 
the internal BgUI site at amino acids 390-391 was cloned into the pPDM B726P C-ORF 
(SEQ ID NO;548) that had been digested with Bglll and was de-phosphorylated. This 
construct, pPDM B726P XC, was then DNA sequenced and showed two nucleotide 

10 changes, which result in two amino acid changes. The cDNA of the full-length clone 
containing these 2 mutations is disclosed in SEQ ID NO:566 vwth the corresponding 
amino acid sequence in SEQ ID NO:573. The full-length expected,, published NY-BR- 
1 is disclosed in SEQ ID NO:564 (cDNA); amino acid SEQ ID NO:565. 

15 EXAMPLE 13 

Isolation of additional 3' sequence and real-time pgr analysis of B726P 

homologNY-BRI.1 
A sequence homolog to the breast candidate B726P, called NY-BR-1.1, 
was identified and published in Cancer Research 61(S):2055-2061; March 1, 2001. The 

20 NY-BR-1.1 gene, thought to be located on chomosome 9 based on 100% sequence 
identity to genomic sequence from chromosome 9, was shown to be expressed as 
mRNA in breast tumors as well as in normal brain. However, the published sequence 
was lacking 3' sequence. Pubhshed incomplete sequence for NY-BR-1.1 is represented 
by GenBank accession number AF269088. A recent BlastN search of the GenBank 

25 High Throughput Genomic Sequence database using Ny-Br-1.1 as a query sequence 
showed a 100% match to the working draft sequence of chromosome 9 (GenBank 
accession number AL359312), yielding further 3' DNA sequence for Ny-Br-1.1. The 
compilation of the Ny-Br-1.1 sequence with the additional 3' sequence from 
chromosome 9 yielded a 3720 bp ORF sequence (SEQ ID NO:576) which encodes a 

30 1240 amino acid protein sequence (SEQ ID NO:577). 
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Real time PCR primers were designed to a unique region of NY-BR-1.1 
to distinguish its mRNA expression profile from B726P. This experiment represents 
relative values, as it was done without template. The jarst-strand cDNA used in the 
quantitative real-time PCR was synthesized from 20 ^lg of total RNA that was treated 
5 with DNase I (Amplification Grade, Gibco BRL Life Technology, Gaithersburg, MD), 
using Superscript Reverse Transcriptase (RT) (Gibco BRL Life Technology, 
Gaithersburg, MD). Real-time PCR was performed with a GeneAmp™ 5700 sequence 
detection system (PB Biosystems, Foster City, CA), The 5700 system uses SYBR™ 
green, a fluorescent dye that only intercalates into double stranded DNA, and a set of 

10 gene-specific forward and reverse primers. The increase in fluorescence was monitored 
during the whole amplification process. The optimal concentration of primers was 
determined using a checkerboard approach and a pool of cDNAs from tumors was used 
in this process. The PCR reaction was performed m 25 volumes that included 2.5 \il 
of SYBR green buffer, 2 of cDNA template and 2.5 yd each of the forward and 

15 reverse primers for the gene of interest. The cDNAs used for quantitative real time 
PCR reactions were diluted 1:10 for each gene of interest and 1:100 for the p-actin 
control. Levels of mRNA were expressed relative to ureter where NY-BR-1.1 
expression was not observed when compared to the P-actin control. 

The real time PCR results show that mRNA expression for NY-BR-1.1 

20 is present in breast tumors as well as in normal adrenal gland, brain, retina and testis. 

EXAMPLE 14 

Characterization of B726P monoclonal and purified polyclonal antibody 

EPrropEs 

25 Mouse monoclonal antibodies and rabbit polyclonal sera were raised 

against E, coli derived B726P recombinant protein and tested by ELIS A as described in 
further detail below, for antibody epitope recognition against overlapping 20 mer 
peptides that correspond to the amino acid sequence of the downstream ORF of B726P 
(B726P dORF, set forth in SEQ ID NO:176, encoded by SEQ ID NO:175). Numerous 

30 peptides were recognized by the monoclonal and polyclonal antibodies. The 
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corresponding amino acid sequences of these peptide antibody epitopes are summarized 
in Table 3 and are set forth in SEQ ID NO:578-593. 

ELISA ANALYSIS: B726P recombinant protein and peptides were 
coated onto 96 well ELISA plate: 50ul/well at 2 ug/ml for 20 hrs at 4C. Plates were 
5 then washed 5 times with PBS + 0.1% Tween 20 and blocked with PBS 4- 1% BSA for 
2 hr. Affinity purified B726P polyclonal antibodies were then added to the wells at 
lug/ml and B726P monoclonal supematants were added neat (220 A43 and 220A86 
were diluted 1/60 and 1/20 respectively). Plates were incubated at room temperature for 
30 minutes and then washed again as above, followed by the addition of SOul/weU 

10 donkey anti-mouse-Ig-HRP antibody for 30 minutes at room temperature. Plates were 
washed, then developed by the addition of lOOul/well of TMB substrate. The reaction 
was incubated 15 minutes in the dark at room temperature and then stopped by the 
addition of lOOul/well of IN H2S04. Plates were read at OD450 in an automated plate 
reader. Peptides v^th OD450 readings three times background or above were 

1 5 considered to be positively recognized by the corresponding antibody. 



Table 3:Peptides recognized by B726P Antibodies 





B726P Monoclonal Supernatant 


B726P 
Purified 
Polyclonal 




220A2.1 


220A19.1 


220A94.1 


220A151.1 


220A43 


220A86 


(lug/ml) 



B726P 


289- 


225- 


73- 


145- 


232- 


145- 


1-20, 9-28, 17- 


peptides 


308 


244, 


252 


164, 


252 


164, 


36, 24-44, 97- 


(amino 




232- 




153- 




153- 


116, 105-124, 


acids) 




252 




172 




172 


113-132, 121- 
140, 129-148, 
137-156 
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EXAMPLE 15 

Analysis of autoantibodies to B726P in breast cancer sera and epitope 

MAPPING OF THE ANTIGENIC SITES 

Specific B726P peptide epitopes were identified that react with 
5 autoantibodies in the serum of breast cancer patients. Thirty-three overlapping peptides 
were synthesized spanning the entire B726P protein. These 33 peptides were tested in 
ELISAs to evaluate which epitopes reacted with breast cancer sera. Reactive epitopes 
were identified throughout the molecule and a total of 16/74 sera samples from breast 
cancer patients had reactivity with one or more peptides. 

10 Thirty-one overlapping synthetic peptides spanning the entire B726P 

downstream ORE sequence (amino acid sequence set forth in SEQ ID NO: 176) were 
synthesized and 30 of these were tested in ELISA with sera from breast cancer patients 
as well as control sera. The amino acid sequences of the 31 overlapping peptides of the 
B726P downstream ORF are set forth in SEQ ED NO:594-624. Three additional 

15 peptides of B726P, set forth in SEQ ID NO:625-627 were also tested. Several peptides 
thoughout the molecule showed reactivity, in particular peptide #2735 (amino acids 31- 
50; SEQ ID NO:597), peptide #2747 (amino acids 151-170; SEQ ID NO:609), peptide 
#2750 (amino acids 181-200; SEQ ID NO:612), peptide #2753 (amino acids 211-230; 
SEQ ID NO:615), and peptide #2766 (amino acids 231-250; SEQ ID NO:617). A total 

20 of 16/74 breast cancer sera were reactive with at least one peptide. 

B726P antibody epitopes were also mapped using rabbit antisera 
generated against the B726P downstream ORF (SEQ ID NO:176). The epitopes 
identified using the rabbit antisera were as foUows: peptide #2732 (amino acids 1-20; 
SEQ ID NO:594), peptide # 2733 (amino acids 11-30; SEQ ID NO:595), peptide #2742 

25 (amino acids 101-120; SEQ ID NO:604), peptide #2743 (amino acids 111-130; SEQ ID 
NO:605), peptide #2744 (amino acids 121-140; SEQ ID NO:606), peptide #2745 
(amino acids 130-151; SEQ ID NO:607), peptide #2751 (amino acids 191-210; SEQ ID 
NO:613), and peptide #2753 (amino acid 211-230; SEQ ID NO:615). Some low level 
reactivity was observed for peptide #2772 (amino acids 291-310; SEQ ID NO:623) and 

30 peptide #2773 (amino acids 298-317; SEQ ID NO:624). 
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The above results confirni that B726P can be used alone or in 
combination wtih other breast tumor antigens as a vaccine target. Additionally, these 
results show that detection of antibodies to B726P can be used as a diagnostic indicator 
of breast cancer either alone or in combination with detection of antibodies to other 
5 antigens (e.g. Her-2/Neu or other antigens known to be expressed in breast cancer 
tissue). 



EXAMPLE 16 

immunohistochemical analysis of b726p expression in metastatic breast 

10 Cancer 

Affinity purified polyclonal antibodies anti-B726Pdown (generated 
against the B726P downstream ORE) were used to assess B726P protein expression in 
metastatic breast cancer samples. 

In order to determine which tissues express the breast cancer antigen 

15 protein B726P immunohistochemistry (IHC) analysis was performed on a diverse range 
of tissue sections. Tissue samples were fixed in formalin solution for 12-24 hrs and 
embedded in paraffin before being sliced into 8 micron sections. Steam heat induced 
epitope retrieval (SHIER) in 0.1 M sodium citrate buffer 6.0) was used for optimal 
staining conditions. Sections were incubated with 10% serum/PBS for 5 minutes. 

20 Primary antibody (rabbit affinity purified anti-B726Pdown) was added to each section 
for 25 minutes followed by 25 minute incubation with anti-rabbit biotinylated antibody. 
Endogenous peroxidase activity was blocked by three 1.5 minute incubations with 
hydrogen peroxidase. The avidin biotin complex/horse radish peroxidase (ABC/HRP) 
system was used along with DAB chromogen to visualize antigen expression. Slides 

25 were counterstainied with hematoxylin to visualize cell nuclei. 

Anti-B726Pdown immunoreactivity was observed in 7 of 10 metastatic 
breast cancer samples analyzed but not observed in various normal tissues including 
normal breast. Thus, these data confirm earlier microanay analysis (see Example 1) 
showing that B726P is overexpressed in breast tumor tissue as compared to normal 

30 tissue. Therefore, this antigen may be used in diagnostic and inimunotherapeutic 
appUcations for breast cancer. 
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EXAMPLE 17 

Analysis of B726P Expression Using Immunoprecipitation and Western blot 

ANALYSIS 

AfBnity purij5ed polyclonal antibodies generated against the B726P 
5 downstream ORF protein set forth in SEQ ID NO: 176 (anti-B726Pdown) were used to 
assess the protein expression of the combined ORF of B726P in breast cancer cell lines 
as compared to normal cells as described below. Since the combined ORF includes 
both the upstream and downstream ORFs, the antibodies generated against the 
downstream ORF crossreact with the combined ORF polypeptide as set forth in SEQ ID 
10 NO:475. 

Cells were lysed in 1% Triton lysis buffer on ice for 10 minutes. Lysates 
were centrifuge at 15000 rpm and supernatant was saved for immunoprecipitation 
(IP)AVestem analysis. 2 of anti-B726down polyclonal antibody was added to the 
supernatant and rocked overnight at 4°C. 20 yd of protein G bead slurry was added and 
15 incubated for 1 hour. Beads were then washed 3 times with 1 ml of lysis buffer. LDS 
sample buffer and p-mercaptoethanol were added and the samples were heated for 5 
min at QS^C. Samples were size fractionated by gel electrophoresis, transferred to 
nitrocellulose and Western blotted widi the mouse anti-B726down monoclonal antibody 
A2,l, 

20 ^^S methionine labeling/IP analysis was carried out as follows: Cells 

were grown in 10% Fetal Bovine Serum (FBS) media to desired density. Cells were 
then starved with DMEM lacking methionine containing 0.1% FBS mediafor 10 - 15 
minutes. FBS was added to a final concentration of 10% along v^th ^S-Methionine 
translabel (300 \iCi - 1 mCi). After mcubating for 3 -4 hours the cells were harvested, 

25 washed, and lysed. B726P was immunoprecipitated as described above and samples 
were size fi-actionated by gel electrophoresis before being exposed to autoradiography 
film . 

The results from the above described experiments showed that the fixll 
length 148 kDa form (also called NYBRl), the 110 kDa combined ORF form, and the 
30 35 kDa downstream ORF form are all expressed in breast tumor cell lines HTB21 and 
BT474 but not in the SKBR3 normal breast cell line. Therefore, these results confirm 



wo 02/083956 



148 



PCT/US02/12378 



that these forms of the B726P protein are expressed in breast tumor cell lines and not in 
normal cells. 

U.S. patents, U.S. patent application publications, U.S. patent 
applications, foreign patents, foreign patent applications and non-patent publications 
5 referred to in this specification and/or listed in the Application Data Sheet, are 
incorporated herein by reference in their entirety. 

From the foregomg it will be appreciated that, although specific 
embodiments of the mvention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
10 mvention. Accordmgly, the invention is not limited except as by the appended claims. 
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CLAIMS 



What is Claimed: 



1. A method for stimulating and/or expanding T cells specific for a 
tumor protein, comprising contacting T cells with at least one component selected from 
the group consisting of: 

(a) polypeptides comprising an amino acid sequence set forth in any 

one of SEQ IDNOs:537, 534-536, and 538-547; 

(b) sequences having at least 70% identity to an amino acid sequence 
set forth in SEQ ID NOs:537, 534-536, and 538-547; 

(c) sequences having at least 90% identity to an amino acid sequence 
set forth in SEQ ID NOs:537, 534-536, and 538-547; 

(d) polynucleotides comprismg any one of the sequences set forth in 
SEQ ID NOs:530, 520-529, and 531-533; 

(e) sequences having at least 70% identity to a polynucleotide set 
forth in SEQ ID NOs:530, 520-529, and 531-533; 

(f) sequences having at least 90% identity to a polynucleotide set 
forth in SEQ ID NOs:530, 520-529, and 531-533; and 

(g) antigen-presenting cells that express a polynucleotide according 

to(d),(e),or(0; 

under conditions and for a tune sufficient to permit the stimulation 
and/or expansion of T cells. 



2. An isolated T cell population, comprising T cells prepared 
according to the method of claim 1 . 
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3. A method for the treatment of breast cancer in a patient, 

comprising the steps of: 

(i) incubating CD4+ and/or CD8+ T cells isolated from a patient, 
such that T cell proliferate, wth at least one component selected from the group 
consisting of: 

(a) polypeptides comprising an amino acid sequence set forth 
in any one of SEQ ID NOs:537, 534-536, and 538-547; 

(b) sequences havmg at least 70% identity to an amino acid 
sequence set forth in SEQ ID NOs:537, 534-536, and 538-547; 

(c) sequences, having at least 90% identity to an amino acid 
sequence set forth in SEQ ID NOs:537, 534-536, and 538-547; 

(d) polynucleotides comprising any one of the sequences set 
forth m SEQ ID NOs:530, 520-529, and 531-533; 

(e) sequences having at least 70% identity to a polynucleotide 
set forth in SEQ ID NOs:530, 520-529, and 531-533; 

(f) sequences having at least 90% identity to a polynucleotide 
set forth in SEQ ID NOs:530, 520-529, and 531-533; and 

(g) antigen-presenting cells that express a polynucleotide 

according to (d), (e), or (f); 

(ii) administering to the patient an effective amount of the proliferated T 

cells, and 

thereby inhibiting the development of a cancer in the patient 

4. An isolated polynucleotide comprising a sequence selected from 

the group consisting of: 

(a) sequences provided in SEQ ID NOs:530, 520-529, and 531-533; 

(b) complements of the sequences provided in SEQ ID NOs:530, 

520-529, and 531-533; 

(c) sequences consisting of at least 20 contiguous residues of a 
sequence provided in SEQ ID NOs:530, 520-529, and 531-533; 
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(d) sequences that hybridize to a sequence provided in SEQ ID 
NOs:530, 520-529, and 531-533, under highly stringent conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 
NOs:530, 520-529, and 531-533; 

(f) sequences having at least 90% identity to a sequence of SEQ ID 
NOs:530, 520-529, and 531-533; and 

(g) degenerate variants of a sequence provided in SEQ ID NOs:530, 
520-529, and 531-533. 

5. An isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of: 

(a) sequences encoded by a polynucleotide of claim 4; and 

(b) sequences having at least 70% identity to a sequence encoded by 
a polynucleotide of claim 4; and 

(c) sequences having at least 90% identity to a sequence encoded by 

a polynucleotide of claim 4; 

(d) sequences set forfli in SEQ ID NOs:537, 534-536, and 538-547; 

(e) sequences having at least 70% identity to a sequence set forth in 
SEQ ID NOs:537, 534-536, and 538-547; and 

(f) sequences having at least 90% identity to a sequence set forth in 
SEQ ID NOs:537, 534-536, and 538-547. 

6. An expression vector comprising a polynucleotide of claim 4 
operably linked to an expression control sequence. 

7. A host cell transformed or transfected with an expression vector 
according to claim 6. 



8. A method for detectmg the presence of a cancer in a patient, 

comprising the steps of: 

(a) obtaining a biological sample from the patient; 
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(b) contacting the biological sample with a binding agent that binds 
to a polypeptide of claim 5; 

(c) detecting in the sample an amount of polypeptide that binds to 
the binding agent; and 

(d) comparing the amount of polypeptide to a predetermined cut-oflf 
value and therefrom determining the presence of a cancer in the patient. 

9. A fusion protein comprising at least one polypeptide according to 

claim 5. 

10. A composition comprising a first component selected from the 
group consisting of physiologically acceptable carriers and immimostimulants, and a 
second component selected JSrom the group consisting of: 

(a) polypeptides according to claim 5 ; 

(b) polynucleotides according to claim 4; 

(c) fusion proteins according to claim 9; 

(d) T cell populations according to claim 2; and 

(e) antigen presenting cells that express a polypeptide according to 

claim 7. 



11. A method for stimulating an immune response in a patient, 
comprising administering to the patient a composition of claim 10. 



12. A method for the treatment of a breast cancer in a patient, 
comprising administering to the patient a composition of claim 10. 
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1/2 

SYN18C6 NORTHERN BLOT 



2.37 kb 



1.35 kb 



0.24 kb 




Fig. 1 



SUBSTITUTE SHEET (RULE 26) 
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2/2 



D207 B726P-specific CTL Clone 1 -9Arecognlze HTB 21 
a breast tumor cell line that expresses HLA-B*1501 and B726P 




D207 B726P Fibroblast 



B726P Positive Tumor 



B726P Negative Tumor 



Fig. 2 

SUBSTITUTE SHEET (RULE 26) 
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SEQUENCE LISTING 

<110> Houghton, Raymond L. 
Sleath, Paul R. 
Persing, David H. 
Jiang, Yuqiu 
Dillon, Davin C. 
Mitcham, Jennifer L. 
Xu, Jiangchun 
Har locker, Susan L. 
Hepler, William T. 
Henderson, Robert A- 
Fanger, Gary R. 
Vedvick, Thomas S. 
McNeill/ Patricia D. 
Durham, Margarita 



<120> COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF BREAST CANCER 

<130> 210121. 47003PC 

<140> PCT 

<141> 2002-04-15 

<160> 627 

<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 281 
<212> DNA 
<213> Homo sapien 



cttcatattt agctattgtt cgtattgcct 60 

caaatgccac aacctgagaa gcggtgggct 120 

atatttccgt acgcaattac aattcagttt 180 

aaatgagata ttctctgatt tattctacta 240 

taaccacatc g 281 

<210> 2 

<211> 300 

<212> DNA 

<213> Homo sapien 



<400> 1 
caatgacagt 
tctgtcctag 
aagataggtc 
ctgtgacatc 
taacactcta 



caatctctat 
gaacagtcat 
ctactgcaaa 
tctttacacc 
catagagcta 



cgacagcctg 
atctcaagtt 
ccacccctcc 
actggaggaa 
tggtgagtgc 



<400> 2 

gaggtcctgg gctaacctaa tggtttatta ttggtggaga gaaagatctg gaaatacttg 
aggttattac atactagatt agcttctaat gtgaaccatt tttcttttaa cagtgataaa 
ttattatttc cgaagttaac tgttcccttg gtcgtgatac acactcgatt aacaaacata 
ctgttgtatt ttttccagtt ttgtttggct atgccaccac agtcatcccc agggtctata 
catactatgt ctcaactgta ttatttgcca tttttggcat tagaatgctt cgggaaggct 



<210> 3 
<211> 302 
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<212> DNA 

<213> Homo sapien 

<400> 3 

ggccgaggta attggttaag tctaaagaga ttattattcc ttgatgtttg ctttgtattg 60 

gctacaaatg tgcagaggta atacatatgt gatgtcgatg tctctgtctt tttttttgtc 120 

tttaaaaaat aattggcagc aactgtattt gaataaaatg atttcttagt atgattgtac 180 

agtaatgaat gaaagtggaa catgtttctt tttgaaaggg agagaattga ccatttattg 240 

ttgtgatgtt taagttataa cttatcgagc acttttagta gtgataactg tttttaaact 300 

tg 302 

<210> 4 

<211> 293 

<212> DNA 

<213> Homo sapien 

<400> 4 

tgtaccaatc ctttggcaca agaatatgta agaactatag ttgtttttat tggtttttgt 60 

tcttgagatt gttttcattc tgtttttgac tgtatctctt taggaggctg aggatggcat 120 

tattgcttat gatgactgtg gggtgaaact gactattgct tttcaagcca aggatgtgga 180 

aggatctact tctcctcaaa tacgagataa ggcaagataa ttctgctcat tcgagagagg 240 

gttaagagtt gtcatcttaa tcataaatcc tgcaggatgg gttcttcaaa ttt 293 

<210> 5 
<211> 275 
<212> DNA 
<213> Homo sapien 

<400> 5 

cgaggtttgg aatcagactt ctgtgtccag taaaaaactc ctgcactgaa gtcattgtga 60 

cttgagtagt tacagactga ttccagtgaa cttgatctaa tttcttttga -tctaatgaat 120 

gtgtctgctt accttgtctc cttttaattg ataagctcca agtagttgct aattttttga 180 

caactttaaa tgagtttcat tcacttcttt tacttaatgt tttaagtata gtaccaataa 240 

tttcattaac ctgttctcaa gtggtttagc tacca 2 75 

<210> 6 

<211> 301 

<212> DNA 

<213> Homo sapien 

<400> 6 

gaggtctggt ttcctgggta tgcctggact gttgcccagt gtaagatctg tgcaagccat 60 

attggatgga agtttacggc caccaaaaaa gacatgtcac ctcaaaaatt ttggggctta 120 

acgcgatctg ctctgttgcc cacgatccca gacactgaag atgaaataag tccagacaaa 180 

gtaatacttt gcttgtaaac agatgtgata gagataaagt tatctaacaa attggttata 240 

ttctaagatc tgctttggaa attattgcct ctgatacata cctaagtaaa cataacatta 300 

a 301 

<210> 7 

<211> 301 

<212> DNA 

<213> Homo sapien 

<400> 7 

gtccagtttg tacacagtga ttccttatgc acgccgaaag ggtttccgta aaaatgacat 60 

tatatacaaa tctgtacacc catccaccag agcgattctc cagctcccag agggagttat 120 

caacttaaag caggatacct gaggtttcat gtctttagtt gccttatcat aatcccaaat 180 

atacatttca gggtttgttt ttgtttttaa agacactttc ctggaatatg tgcactatgg 240 
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ttaaaattaa aaacaaaagt aataaaataa aatgatcgct ggaaggactg acctccccac 300 
c 301 

<210> 8 

<211> 301 

<212> DNA 

<213> Homo sapien 



<400> 8 

ctgtcctcat ctctgcaaag ttcagcttcc ttccccaggt ctctgtgcac tctgtcttgg 60 

atgctctggg gagctcatgg gtggaggagt ctccaccaga gggaggctca ggggactggt 120 

tgggccaggg atgaatattt gagggataaa aattgtgtaa gagccaaaga attggtagta 180 

gggggagaac agagaggagc tgggctatgg gaaatgattt gaataatgga gctgggaata 240 

tggctggata tctggtacta aaaaagggtc tttaagaacc tacttcctaa tctcttcccc 300 

a 301 



<210> 9 

<211> 301 

<212> DNA 

<213> Homo sapien 



<400> 9 

gaggtctgcc taagtagagg acaaagactt cctcctttca aaggagaact gagcccagga 60 

ttggtaagtt. taaggcactt aaccttgacc agctctgtag gtctggagca ttctggtccc 120 

tggccgcttt caccaccagg cccttctcac ttatccacct cacatactgc cccagcattc 180 

ctttggcatt gcgagctgtg acttgacaca ttttaatgac aagattgaag tagctacctt 240 

gcaggataga ttttctgggg tataggggac aaaccaacag tgccatcagg tgtcttaaca 300 

c 301 



<210> 10 

<211> 301 

<212> DNA 

<213> Homo sapien 



<400> 10 

ggcaggtcca acagt-tcttc cagttctggt cgagctttga atcgtccctt gaagtcttct 60 

tcagtgtgct ccttcactga cagtctgact ccttcaggaa gactgctttg gattatttcc 120 

aagaaaattt ctgcaaacgt agcactcaaa ccgctgatct gaaccactcg ctcatgggtg 180 

gtaagcactg agtccaggag cattttgctg ccttggtcct gcaactgcaa cacttctatg 240 

gttttggttg gcattgcata actttcctcg actttaatgg agagagattg cagaggttgt 300 

g 301 



<210> 11 

<211> 301 

<212> DNA 

<213> Homo sapien 



<400> 11 

aggtctgtga ctttcaccca ggacccagga cgcagccctc cgtgggcact gccggcgcct 60 

tgtctgcaca ctggaggtcc tccattacag aggcccagcg cacatcgctg gccccacaaa 120 

cgttcagggg tacagccatg gcagctcctt cctctgccgt gagaaaagtg cttggagtac 180 

ggtttgccac acacgtgact ggacagtgtc caattcaaat ctttcagggc agagtccgag 240 

cagcgcttgg tgacagcctg tcctctcctg ctctccaaag gccctgctcc ctgtcctctc 300 

t 301 



<210> 12 
<211> 301 
<212> DNA 
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<213> Homo sapien 



<400> 12 

gaggtctggg attacaggca cgtgccacca cacctagcta atttttgagc atggggctca 60 

aaggaactgc tctctggggc atgtcagatt tcggatttgg ggctgcacac tgatactctc 120 

taagtggtgg aggaacttca tcccactgaa attcctttgg catttggggt tttgtttttc 180 

tttttttcct tcttcatcct cctccttttt taaaagtcaa cgagagcctt cgctgactcc 240 

accgaagaag tgcaccactg ggagccaccc cagtgccagg cgcccgtcca gggacacaca 300 

c 301 



<210> 13 

<211> 256 

<212> DNA 

<213> Homo sapien 



<400> 13 

ttttttggca taaaaaacac aatgatttaa tttctaaagc acttatatta ttatggcatg 60 

gtttgggaaa caggttatta tattccacat aggtaattat gcagtgcttc tcatggaaaa 120 

aatgcttagg tattggcctt ttctctggaa accatatttt tcctttttta ataatcaact 180 

aaaatgtata tgttaaaaag cctcatcttt tgattttcaa tatacaaaat gctttcttta 240 
aaagaacaag attcaa 



<210> 14 

<211> 301 

<212> DNA 

<213> Homo sapien 



<400> 14 

ggtccttgat agaggaagag gaatatccaa ggcaaagcca ccaccacgtc caacctcctc 60 
atcctctacc tttcctgtcc ccagaggtat gagatagacc ccctggcctg gttcctgcac 120 
tgtgctaggc ccacagtgga cacttccacc ttaatggaga ataggcccca tggagtggag 180 
gtccctcctc catggcctgc aacccaatga ctatgggggt gacacaagtg acctctgccc 240 
tgtgatggct caacaccatc acacgcaact gtccagacaa gccccctcaa cgggctgctg 
t 



300 
301 



<210> 15 

<211> 259 

<212> DNA 

<213> Homo sapien 



<210> 16 

<211> 301 

<212> DNA 

<213> Homo sapien 



60 



<400> 15 

gtcttgaaag tatttattgt ttaataattc tttctcccct cagccccatc cggccactct 

ctctttctgc ttttctgatc atcctaaagg ctgaatacat cctcctcctg tgtggaggac 120 

acgaagcaat actaaaatca atacactcga tcaggtcttc atcagatacc acgtcactgt 180 

gggtagagtg ctaattttca acaaatgtgg tgttcttagg gccccacaag gtagtccttt 240 

ctcaaggtcg ctgggccac 259 



<400> 16 

cgaggttgtt cacattttca aataaataat actccccgta agtaataact gcaaccaatc 
agtgttattc agtgctatgc ctccttgtaa tgggtagtta ttaattattt tcagagcttt 120 
ctggaaatac tgtcctaact ggctatgttt aggatctttg ttatctctga agacaaagaa 180 
agaactagga ctcttaattt tggggtgctt cttgactctt agttgggaaa ctgaaaatat 240 
ttccaacctt ttacccacgt caatggcata ttctgggaat caccaccacc accaccacta 



60 



300 
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c 301 

<210> 17 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 17 

gcccgggcag gtctggggcc tagggtggct ctttgcaaag ctgaggggca agctaaggaa 60 

gccaggcagg tcaggggccc tttcggcctt ctcaagcctc cacctgagtt ctcgtcaatg 120 

ccagtctccc tggtatgatt ggggacatta tcagagaaac atctaatagc gcacatctgg 180 

gcacccacac tctgcttcag ttgcatccat cctcccaccc caaattcaac tcctgaccca 240 

atacaaaaga cttttttaac caggatttct tcttgcagga aagctgactt ggaaacacgg 300 

g 301 

<210> 18 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 18 

attacaggca cgtgccacca cacctagcta atttttgagc atggggctca aaggaactgc 60 

tctctggggc atgtcagatt tcggatttgg ggctgcacac tgatactctc taagtggtgg 120 

aggaacttca tcccactgaa attcctttgg catttggggt tttgtttttc tttttttcct 180 

tcttcatcct cctccttttt taaaagtcaa cgagagcctt cgctgactcc accgaagaag 240 

tgcaccactg gggaccaccc agtgccaggc gcccgtccag ggacacacac agtcttcact 300 

g 301 

<210> 19 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 19 

agaatctctg cactgtcatc aggtacaaca aaagatcaaa cccctgtccc gatgttaact 60 

ttttaactta aaagaatgcc agaaaaccca gatcaacact ttccagctac gagccgtcca 120 

caaaggccac ccaaaggcca gtcagactcg tgcagatctt attttttaat agtagtaacc 180 

acaatacaca gctctttaaa gctgttcata ttcttccccc attaaacacc tgccccgggc 240 

ggccaagggc gaattctgca gatatccatc acactggcgg ccgctcgagc atgcatctag 300 

a 301 

<210> 20 
<211> 290 
<212> DNA 
<213> Homo sapien 

<400> 20 

aggttttttt tttttttttt tttttttttt tttttccctt tcaattcatt taatttcaac 60 

aatctgtcaa aaaacagcca ataaacaaat actgaattac attctgctgg gttttttaaa 120 

ggctctaaac tataaaaaca tcttgtgtct cccaccctga ccaccctgct acttttccat 180 

ataccacagg ccacccataa acacaaagcc agggggtgaa gctgacatgg tctatttgga 240 

gccagtaaac aggagggcga taagtcctga taagcactta tggacaatat 290 

<210> 21 

<211> 301 

<212> DNA 

<213> Homo sapien 
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<400> 21 

agaaaggtaa ctgccagcca ggcttgcatt gtttagccag aaattgctgc ttggttctag 60 

actctttaaa aaaaaaaaat acccagggtt tgtcatcatt ttcagaggca gagtgccaaa 120 

tatcacccaa agctcttgtg tctttttttt acccccttat tttattttta tttattaatt 180 

ttttgtgcaa acatcaaatg tcactggtgt tcacagaagg cttttttgac tagccttaaa 240 

ttcctgagtc aaaagattaa tcagattttc aggcagtgtt taatcaggtg ctttgtcctg 300 

t 301 

<210> 22 

<211> 301 

<212> DNA 

<213> Homo sapien 

<400> 22 

gacgccatgc accctccggt aaccagcagc cgcctgtcca tcccccaaga ccggaaaggc 60 

agcagcagcc cccgggagcc cagggctgtc ctcggtgcat ctggctgcag agggaaattg 120 

atgaccttac acagcaacta gcggccatgc agtccttcac tgacaagttc caggaccttt 180 

gaagttggag ccagcgtccg gagctgcagc caagcgagtt tcctccttat cctccttagc 240 

cagggctttt tctcttccgc tgcatttgcc cccttcccaa cgcaattcaa agcagttgtg 300 

a 301 

<210> 23 

<211> 381 

<212> DNA 

<213> Homo sapien 

<400> 23 

cgaggtccag acagtggacc aagagatacg ctacataaat tggggtttca caattcttac 60 

attatttgtc tgtcacagaa gagagctgct tatgattttg aaggggtcag ggagggtggg 120 

agttggtaaa gagtagggta tttctataac agatattatt cagtcttatt tcctaagatt 180 

ttgttgtaac ttaaggtatc ttgctacagt agacagaatt ggtaatagca acttttaaaa 240 

ttgtcattag ttctgcaata ttagctgaaa tgtagtacag aaaagaatgt acatttagac 300 

atttgggttc agttgcttgt agtctgtaaa tttaaaacag cttaatttgg tacaggttac 360 

acatatggac ctcccgggcg g 381 

<210> 24 

<211> 214 

<212> DNA 

<213> Homo sapien 

•<400> 24 

aatgatgtaa aaattaatca acagggctgc cacttgcgaa tcccctccaa ggatgctgtg 60 

caaagggtct cattggtcct gatgaataat cttgtgactg tacatattcc tgggtgcatg 120 

tccacaaata ctgaggtata gcctgcatgc cactaaaaat aacaaaggtt tcaggggtgg 180 

aaacattgtc caccacactg tcatgaccat cttt 214 

<210> 25 

<211> 302 

<212> DNA 

<213> Homo sapien 



60 



<400> 25 

gggggcactg agaactccct ctggaattct tggggggtgt tggggagaga ctgtgggcct 

ggagataaaa cttgtctcct ctaccaccac cctgtaccct agcctgcacc tgtcctcatc 120 

tctgcaaagt tcagcttcct tccccaggtc tctgtgcact ctgtcttgga tgctctgggg 180 

agctcatggg tggaggagtc tccaccagag ggaggctcag gggactggtt gggccaggga 240 

tgaatatttg agggataaaa attgtgtaag aagccaaaga aattggtagt aggggggaga 300 



ac 



302 
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<210> 26 
<211> 30X 
<212> DNA 
<213> Homo sapien 

<400> 26 

ttggagaacg cgctgacata ctgctcggcc acagtcagtg aagctgctgc atctccatta 60 

tgttgtgtca gagctgcagc caggattcga atagcttcag ctttagcctt ggccttcgcc 120 

agaactgcac tggcctctcc tgctgcctga tttatctgtg cagccttttc tgcttcggag 180 

gccaggatct gggcctgttt cttcccttct gccacattga tggccgactc tcgggtcccc 240 

tcagactcta gaactgtggc ccgtttccgc cgctctgcct ccacctgcat ctgcatagac 300 

t 301 

<210> 27 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 27 

aaatcagtca tcacatctgt gaaaagagtg ctagttataa caaatgagat cacaaatttg 60 

accattttat tagacaccct ctattagtgt taacagacaa agatgaaggt taagttgaaa 120 

tcaaattgaa atcatcttcc ctctgtacag attgcaatat ctgataatac cctcaacttt 180 

cttggtgcaa attaattgcc tggtactcac agtccagtgt taacaggcaa taatggtgtg 240 

attccagagg agaggactag gtggcaggaa aataaatgag attagcagta tttgacttgg 300 

a 301 

<210> 28 

<211> 286 

<212> DNA 

<213> Homo sapien 

<400> 28 

tttttttttg cacaggatgc acttattcta ttcattctcc cccacccttc ccatatttac 60 

atccttagag gaagagaggg gtaaggtgat aaagtaactg aaggaccgca agacgggtat 120 

gtcccttgtt caccaaatgg tcaaagggtc aaagatcgga ggaggtcagg gggtaacgca 180 

ggaacaggtg agggcgtttc gccctctctc cctctcccct tttcaacctc ttaatcactg 240 

gctaactcgc gacctcatgg gttaattcgt aagcttacac gcgttg 286 

<210> 29 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 29 

gtcatgttct tgctcttcct tctttacaca tttgagttgt gccttctgtt cttaaagaga 60 

ttttcctttg ttcaaaggat ttattcctac catttcacaa atccgaaaat aattgaggaa 120 

acaggttaca tcattccaat tttgccttgg gtttgaagag tctctcatgg tggcacagtc 180 

ctccagggta gctatgttgt tgggctcccc tacatcccag aagctcagag actttgtcaa 240 

aggtgtgccg tccacccatt gccactgacc ctcgacaacc tggtctgaca gtccaataaa 300 

a 301 

<210> 30 

<211> 332 

<212> DNA 

<213> Homo sapien 



<400> 30 
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gagcagaatt gatgcctatg gctccaagtc aaatactgct aatctcattt attttcctgc 60 

cacctagtcc tctcctctgg aatcacacca ttattgcctg ttaacactgg actgtgagta 120 

ccaggcaatt aatttgcacc aagaaagttg agggtattat cagatattgc aatctgtaca 180 

gagggaagat gatttcaatt tgatttcaac ttaaccttca tctttgtctg ttaacactaa 240 

tagagggtgt ctaataaaat ggtcaaattt gtgatctcat ttgttataac tagcactctt 30O 

ttcacagatg tgatgactga tttccagcag ac 332 

<210> 31 

<211> 141 

<212> DNA 

<213> Homo sapien 

<400> 31 

aaaggctatc aagtactttg aaggacagga aggaatgaac acacccaggt ggacgtttgg 60 

tttcatttgc aggggttcag ggagggttgc aggggttcag ggagggctct tgtcccacaa 120 

ccgggggaag ggagagggca c 141 

<210> 32 

<211> 201 

<212> DNA 

<213> Homo sapien 

<400> 32 

gagctgatct cacagcacat acagaatgat gctactatgt agaccctcac tcccttggga 60 

aatctgtcat ctaccttaaa gagagaaaaa agatggaaca taggcccacc tagtttcatc 120 

catccaccta cataaccaac atagatgtga ggtccactgc actgatagcc agactgcctg 180 

gggtaaacct tttcagggag g -201 

<210> 33 

<211> 181 

<212> DNA 

<213> Homo sapien 

<400> 33 

tttcaaaaca ctcatatgtt gcaaaaaaca catagaaaaa taaagtttgg tgggggtgct 60 

gactaaactt caagtcacag acttttatgt gacagattgg agcagggttt gttatgcatg 120 

tagagaaccc aaactaattt attaaacagg atagaaacag gctgtctggg tgaaatggtt 180 

c 181 

<210> 34 

<211> 151 

<212> DNA 

<213> Homo sapien 

<400> 34 

atgtcctgca cagtatagct tggacctctg ggcctgaacc agggtgagca tcaaggcccc 60 

catttctcct caccacgggg tcgcttgtca gctccaagaa ccagtctggc cccactgaga 120 

acttttcagt cgagggcctg atgaatcttg g 151 

<210> 35 

<211> 291 

<212> DNA 

<213> Homo sapien 

<400> 35 

tctttagggc aaaatcatgt ttctgtgtac ctagcaatgt gttcccattt tattaagaaa 60 

agctttaaca cgtgtaatct gcagtcctta acagtggcgt aattgtacgt acctgttgtg 120 

tttcagtttg tttttcacct ataatgaatt gtaaaaacaa acatacttgt ggggtctgat 180 
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agcaaacata gaaatgatgt atattgtttt ttgttatcta tttattttca tcaatacagt 240 
attttgatgt attgcaaaaa tagataataa tttatataac aggttttctg t 291 

<210> 36 

<211> 201 

<212> DNA 

<213> Homo sapien 

<400> 36 

ctgatacaat tataataacg gttccctgaa ccttttagag tgcaattaag aacaaaaact 60 

aaattttgtt tacatgaata tggaataaat acaataatca aaatatgact ctccctaaaa 120 

gtgaaacaca caagccaatc cggaactgct gtgcgaaaga taaaatcgag aaaggcaagg 180 

tttcggtagg aggacgcgat g 201 

<210> 37 

<211> 121 

<212> DNA 

<213> Homo sapien 

<400> 37 

catcacactg gcggccgctc gagcatgcat ctagagggcc caattcgccc tataatgagt 60 

cgtattacaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 120 

c 121 

<210> 38 

<211> 200 

<212> DNA 

<213> Homo sapien 

<400> 38 

aaacatgtat tactctatat ccccaagtcc tagagcatga cctgcatgtt ggagatgttg 60 

tacagcaatg tatttatcca gacatacata tatgatattt agagacacag tgattctttt 120 

gataacacca cacatagaac attataatta cacacaaatt tatggtaaaa gaattaatat 180 

gctgtctggt gctgctgtta 200 

<210> 39 
<211> 760 
■<212> DNA 
<213> Homo sapien 

<400> 39 

gcgtggtcgt cggccgaggt cctgggctag acctaatggt ttattattgg tggagagaaa 60 

gatctggaaa tacttgaggt tattacatac tagattagct tctaatgtga accatttttc 120 

ttttaacagt gatcaaatta ttatttcgaa gttaatcgtt cccttggtgg ctgcatacac 180 

atcgcattaa caaacatact gttgtatttt ttcccagttt tgtttggcta tgccaccaca 240 

gtcatcccca gggtctatac atactatgtt tcaactgtat tatttgccat ttttggcatt 300 

agaatgcttc gggaaggctt aaagatgagc cctgatgagg gtcaagagga actggaagaa 360 

gttcaagctg aattaaagaa gaaagatgaa gaagtaagcc atggcactgt tgatctggac 420 

caaaaaggca ctcaactagg aataaacact ctacagaggt ttctcagtgg ccccatctgt 480 

gtgatatgcg gggctacaca aaaatagctt cttttgcttt gttctgttct tatacctgtc 540 

tgtgatctga cttggggttg gtgtgaatgt agtagagaaa ggaagctgac agatgaatac 600 

tgaacacagg taatcagttt ccttaattag gttgattata agctcctgaa aagcaggaac 660 

tgtattttat aattttacct gtttctcccg tggtgtctag gatagtaagt gagcagagca 720 

gtaaatactg tttggtttgt tcagacctgc ccgggcggcc 760 



<210> 40 
<211> 452 
<212> DNA 
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<213> Homo sapien 
<400> 40 

aatcactaaa gatattgact agagaatgct gtgtgctatt tcaattacat ttgtttttct 
tttattaaca ggaattttga ttcttcaagg aagtggctca atttcaattt caaataacca 
ggtttatcgt gacttttcct tcttgtttac ttttcgctag gaaggggagt tgtaggggca 
gattcaggta ttggaatagg aaaattacgt ctaaaccatg gaaatcttgg aaatggaatt 
ggtggaagtg ggcgaaatgg atatgggtaa gggaacacaa aaaaccctga agctaattca 
tcgctgtcac tgatacttct tttttctcgt tcctggtctt gagagactgg gaaaccaaca 
gccactgcca agatggctgt gatcaggagg agaactttct tcatctcaaa cgtttcagtc 
agttctttct ctcacctcgg ccgcgaccac gc 

<210> 41 
<211> 676 
<212> DNA 
<213> Homo sapien 

<400> 41 

aatctttgaa tgccaagtct cttctgtact ttcttttatt aacatcatag tctttgcatc 



ttccccattt aaattttaca ttacttgcca agaaaaaaaa aaaattaaaa ctcaagttac 

ttgaagcctg gacacacttc catgattagc cgggctaggt aaaagttggt ggctttattc 

ttcctgctct ataagcagat ccaggcccta gaaagatggg accagggtat ataattgttt 

ttgaaaagtg tgctacaaaa atggatggcc tgttataagc caggatacaa agttaaggat 



60 
120 
180 
240 
300 
360 
420 
452 



aagatacata gcaatgatag caggtttctt tttaaagctt agtattaata ttaaatattt 120 

ISO 
240 
300 

gggggtaagg gagggacatt ttcttccaga agaaaagaca gaatttctga agagtcccag 420 

tccataattt tcccaaaatg gttggaggag agggtaaaat ctcaacatga gtttcaaagt 480 

actgtctctg tgaggggccg gtagatgcct tgctgaggag ggatggctaa tttggaccat 540 

gccccatccc cagctaggag aatggaaatg gaaactttaa ttgcccagtg ggtgtgaaag 600 

tgggctgaag cttggttggt actgaattct ctaagaggtt tcttctagaa acagacaact 660 
cagacctgcc cgggcg 

<210> 42 
<211> 468 
<212> DNA 
<213> Homo sapien 

<400> 42 

agcgtggtcg cggccgaggt ttggccggga gcctgatcac ctgccctgct gagtcccagg 60 

ctgagcctca gtctccctcc cttggggcct atgcagaggt ccacaacaca cagatttgag 120 

ctcagccctg gtgggcagag aggtagggat ggggctgtgg ggatagtgag gcatcgcaat 180 

gtaagactcg ggattagtac acacttgttg attaatggaa atgtttacag atccccaagc 240 

ctggcaaggg aatttcttca actccctgcc ccccagccct ccttatcaaa ggacaccatt 300 

ttggcaagct ctatgaccaa ggagccaaac atcctacaag acacagtgac catactaatt 360 

aaaaccccct gcaaagccca gcttgaaacc ttcacttagg aacgtaatcg tgtcccctat 420 
cctacttccc cttcctaatt ccacagacct gcccgggcgg ccgctcga 



468 

<210> 43 
<211> 408 
<212> DNA 
<213> Homo sapien 

<400> 43 

atcatatcaa aacactatct tcccatctgt ttctcaatgc ctgctacttc ttgtagatat 60 

ttcatttcag gagagcagca gttaaacccg tggattttgt agttaggaac ctgggttcaa 120 

acctctttcc actaattggc tatgtctctg gacagttttt tttttttttt ttttttttaa 180 

accctttctg aactttcact ttctatggct acctcaaaga attgttgtga ggcttgagat 240 

aatgcatttg taaagggtct gccagatagg aagatgctag ttatggattt acaaggttgt 300 

taaggctgta agagtctaaa acctacagtg aatcacaatg catttacccc cactgacttg 360 
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gacataagtg aaaactagcc cgaagtctct ttttcaaatt acttacag 408 

<210> 44 

<211> 160 

<212> DNA 

<213> Homo sapien 



<400> 44 

tggtcgcggc cgaggtcttg tgtgccctgt ggtccagggg accaagaaca acaagatcca 60 

ctctctgtgc tacaatgatt gcaccttctc acgcaacact ccaaccagga ctttcaacta 120 

caacttctcc gctttggcaa acaccgtcac tcttgctgga 160 



<210> 45 

<211> 231 

<212> DNA 

<213> Homo sapien 



<400> 45 

cgagcggccg cccgggcagg tctggggagg tgattccatc cagagtcata tctgttgtca 60 

ccccaataag tcgatcagca aggctgacag gctgtgagga aaccccggcc ttgtagcctg 120 

tcacctctgg ggggatgatg actgcctggc agacgtaggc tgtgatagat ttgggagaaa 180 

acctgactca ccctcaggaa tccggaggtc ggtgacattg tcggtgcaca c 231 

<210> 46 

<211> 371 

<212> DNA 

<213> Homo sapien 



<400> 46 

cccgggcagg tctgtgtaac atgccaaggc tttgcacttt ctgcagagca gttttttatt 60 

ttccttatca ggtacaggtt ttggtttttc ttgactatct ctgatgaatt tttcatgagt 120 

ctgtatatgc agaatctttt ccctaaatac tgcttcgtcc catgtctgaa ggcgtaaaat 180 

aaagtcattc atcatttttt ctttgtacat gtttatttgt tctttttcaa ttacaccaag 240 

cattactagt cagaaggaag cacttgctac ctcttgctct tcctctgcct ctggtttgga 300 

tcattttgat gacattgccc acattactca tgaaggatga caagattgca ctgtgcaatg 360 

tcaattgcct t 371 



<210> 47 
<211> 261 
<212> DNA 
•<213> Homo sapien 



<400> 47 

gccctgtttt tatacacttc acatttgcag aaatataatg atgccctcat tatcagtgag 60 

catgcacgaa tgaaagatgc tctggattac ttgaaagact tcttcagcaa tgtccgagca 120 

gcaggattcg atgagattga gcaagatctt actcagagat ttgaagaaaa gctgcaggaa 180 

ctagaaagtg tttccaggga tcccagcaat gagaatccta aacttgaaga cctctgcttc 240 

atcttacaag aagagtacca c 261 

<210> 48 

<211> 701 

<212> DNA 

<213> Homo sapien 



<400> 48 

cgagcggccc ccgggcaggt ccaattagta caagtctcat gatataatca ctgcctgcat 60 

acatatgcac agatccagtt agtgagtttg tcaagcttaa tctaattggt taagtctcaa 120 

agagattatt attcttgatg tttgctttgt attggctaac aaatgtgcag aggtaataca 180 
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tatgtgatgt ccgatgtctc tgtctttttt 
gtatttgaat aaaatgattt cttagtatga 
tttctttttg aaagggagag aattgaccat 
ttgagcactt ttagtagtga taactgtttt 
gtttgtaatg tgacttattt aacccccttt 
acagcgtgtt ttagaagatt taaatttttt 
agagggcctg attttataga agccccttga 
tgagaaatta tgtgatctgt gtgttgtggg 
tgtagtgcca ttagaaactg tgaatttcca 



tttgtcttta aaaaataatt ggcagcaact 

ttgtaccgta atgaatgaaa gtggaacatg 

ttattattgt gatgtttaag ttataactta 

taaacttgcc taataccttt cttgggtatt 

tttgtttgtt taagttgctg ctttaggtta 

tcctgtctgc acaattagtt attcagagca 

aaagaggtcc agatgagagc agagatacag 

aagagaattt tcaatatgta actacggagc 

aataaatttg a 



240 
300 
360 
420 
480 
540 
600 
660 
701 



<210> 49 

<211> 270 

<212> DNA 

<213> Homo sapien 

<400> 49 

agcggccgcc cgggcaggtc tgatattagt agctttgcaa ccctgataga gtaaataaat 60 

tttatgggcg ggtgccaaat actgctgtga atctatttgt atagtatcca tgaatgaatt 120 

tatggaaata gatatttgtg cagctcaatt tatgcagaga ttaaatgaca tcataatact 180 

ggatgaaaac ttgcatagaa ttctgattaa atagtgggtc tgtttcacat gtgcagtttg 240 

aagtatttaa attaaccact cctttcacag 270 

<210> 50 

<211> 271 

<212> DNA 

<213> Homo sapien 

<400> 50 

atgcatttat ccatatgaac ttgattattc tgaattactg actataaaaa ggctattgtg 60 

aaagatatca cactttgaaa cagcaaatga attttcaatt ttacatttaa ttataagacc 120 

acaataaaaa gttgaacatg cgcatatcta tgcatttcac agaagattag taaaactgat 180 

ggcaacttca gaattatttc atgaagggta caaacagtct ttaccacaat tttcccatgg 240 

tcttatcctt caaaataaaa ttccacacac t 271 

<210> 51 

<211> 241 

<212> DNA 

<213> Homo sapien 

<400> 51 

tggtcgcggc cgaggtgtga ggagatgaac tttgtgttaa tggggggcac tttaaatcga 60 

aatggcttat ccccaccgcc atgtaagtta ccatgcctgt ctcctccctc ctacacattt 120 

ccagctcctg ctgcagttat tcctacagaa gctgccattt accagccctc tgtgattttg 180 

aatccacgag cactgcaggc cctccacagc gttactaccc agcaggcact cagctcttca 240 

t 241 

<210> 52 

<2il> 271 

<212> DNA 

<213> Homo sapien 

<400> 52 

tccaagactt aaaacttagg aaacacctat gatgccactt taactggaag taatggagac 60 

atctgattcc aaattcacat tttaaatgcc tatttgcaat cagcaaagag ccaggtatgc 120 

tgcatgctgc ttgctgtaag ttacgatttg gcttcactag ctcaaatttt ttcactccac 180 

caaaagataa ggcacaggcc cgtttgtcca atcaagtttg ctgaaaatac tgcagcctga 240 

gtgtagacaa acttcccctg aatttgctag a 271 
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<210> 53 

<211> 493 

<212> DNA 

<213> Homo sapien 



<400> 53 

ttagcgtggt cgcggtccga ggtctggcct gactagctca ctctgaagag tgtctttcac 60 

atggattaac caaaaaatgc attactgcct ttggcacact gtcttgaata ttctttctga 12 0 

caatgagaaa atatgattta atggagtcgt tcaataacct cacaatctcg ctgttccgag 180 

cagatagttt tcgtgccaac aggaactggc acatctagca ggttcacggc atgacctttt 240 

tgtggactgg ctggcataat tggaatgggt tttgattttt cttctgctaa taactcttca 300 

agcttttgaa gttttcaagc attcctctcc agttgcctgt ggttggttct tgaacaccat 360 

ctccaacccc accacctcca gatgcaacct tgtctcgtga tacagacctg cccgggcggc 420 

cctcaagggc gaattctgca gatatccatc acactggcgg ccgctcgagc atgcatctag 480 

agggcccaat teg 493 

<210> 54 

<211> 321 

<212> DNA 

<213> Homo sapien 



<400> 54 

cgtggtcgcg gccgaggtct gtttgcttgt tggtgtgagt ttttcttctg gagactttgt 60 

actgaatgtc aataaactct gtgattttgt taggaagtaa aactgggatc tatttagcca 120 

ctggtaagct tctgaggtga aggattcagg gacatctcgt ggaacaaaca ctccccactg 180 

gactttctct ctggagatac ccttttgaat atacaatggc cttggctcac taggtttaaa 240 

tacaaacaag tctgaaaccc actgaagact gagagattgc agcaatattc tctgaattag • 300 

gatcgggttc cataactcta a 321 

<210> 55 

<211> 281 

<212> DNA 

<213> Homo sapien 



<400> 55 

ttgcaaatga aactgtggat gtataataag aaaacacaag ggtttattct taacactaaa 60 

attaacatgc cacacgaaga ctgcattaca gctctctgtt tctgtaatgc agaaaaatct 120 

gaacagccca ccttggttac agctagcaaa gatggttact tcaaagtatg gatattaaca 180 

gatgactctg acatatacaa aaaagctgtt ggctggacct gtgactttgt tggtagttat 240 

cacaagtatc aagcaactaa ctgttgtttc tccgaagatg g 281 

<210> 56 

<211> 612 

<212> DNA 

<213> Homo sapien 



<400> 56 

gcgtggtcgc ggccgaggtc ctgtccgggg gcactgagaa ctccctctgg aattcttggg 60 

gggtgttggg gagagactgt gggcctggag ataaaacttg tctcctctac caccaccctg 120 

taccctagcc tgcacctgtc ctcatctctg caaagttcag cttccttccc caggtctctg 180 

tgccactctg tcttggatgc tctggggagc tcatgggtgg aggagtctcc accagaggga 240 

ggctcagggg actggttggg ccagggatga atatttgagg gataaaaatt gtgtaagagc 300 

caaagaattg gtagtagggg gagaacagag aggagctggg ctatgggaaa tgatttgaat 360 

aatggagctg ggaatatggc tggatatctg gtactaaaaa agggtcttta agaacctact 420 

tcctaatctc ttccccaatc caaaccatag ctgtctgtcc agtgctctct tcctgcctcc 480 

agctctgccc caggctcctc ctagactctg tccctgggct agggcagggg aggagggaga 540 

gcagggttgg gggagaggct gaggagagtg tgacatgtgg ggagaggacc agacctgccc 600 

gggcggccgt eg 612 
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<210> 57 

<211> 363 

<212> DNA 

<213> Homo sapien 

<400> 57 

gtcgcggccg aggtcctgag cgtcacccta gttctgcccc tttttagctg tgtagacttg 60 
gacaagacat ttgacttccc tttctccttg tctataaaat gtggacagtg gacgtctgtc 120 
acccaagaga gttgtgggag acaagatcac agctatgagc acctcgcacg gtgtccagga 180 
tgcacagcac aatccatgat gcgttttctc cccttacgca ctttgaaacc catgctagaa 240 
aagtgaatac atctgactgt gctccactcc aacctccagc gtggatgtcc ctgtctgggc 300 
cctttttctg ttttttattc tatgttcagc accactggca ccaaatacat tttaattcac 360 

363 

<210> 58 

<211> 750 

<212> DNA 

<213> Homo sapien 

<400> 58 

cgtggtcgcg gccgaggtct aattccacct gactggcaga acctgcgccc ctcgcctaac 60 

ctgcgccctt ctcccaactc gcgtgcctca cagaacccag gtgctgcaca gccccgagat 120 

gtggcccttc ttcaggaaag agcaaataag ttggtccaag tacttgatgc ttaaggaata 180 

cacaaaggtg cccatcaagc gctcagaaat gctgagagat atcatccgtg aatacactga 240 

tgtttatcca gaaatcattg aacgtgcatg ctttgtccta gagaagaaat ttgggattca 300 

actgaaagaa attgacaaag aagaacacct gtatattctc atcagtaccc ccgagtccct 360 

ggctggcata ctgggaacga ccaaagacac acccaagctc ggtctcttct tggtgattct 420 

gggtgtcatc ttcatgaatg gcaaccgtgc cagtgaggct gtcttttggg aggcactacg 480 

caagatggga ctgcgtcctg gggtgagaca tcccctccct tggagatcta aggaaacttc 540 

tcacctatga gtttgtaaag cagaaatacc tggactacag acgagtgccc aacagcaacc 600 

ccccggagta tgagttcctc tggggcctcc gtccctacca tgagactagc aagatgaaaa 660 

tgctgagatt cattgcagag gttcagaaaa gagaccctcg tgactggact gcacagttca 720 

tggaggctgc agatgaggac ctgcccgggc 750 

<210> 59 

<211> 505 

<212> DNA 

<213> Homo sapien 

<400> 59 

tggccgcccg ggcaggtcca gtctacaagc agagcactct catggggagc accagatgag 60 

ttccagccgc agttctttta taagctttaa gtgcctcatg aagacgcgag gatctcttcc 120 

aagtgcaacc tggtcacatc agggcacatt cagcagcaga agtctgtttc cagtatagtc 180 

cttggtatgg ctaaattcca ctgtcccttt ctcagcagtc aataatccat gataaattct 240 

gtacaacact gtagtcaata acagcagcac cagacagcat attaattctt ttaccataaa 300 

tttgtgtgta attataatgt tctatgtgtg gtgttatcaa aagaatcact gtgtctctaa 360 

atatcatata tgtatgtctg gataaataca ttgctgtaca acatctccaa catgcaggtc 420 

atgctctaag acttggggat atagagtaat acatgtttcg tggacctcgg ccgcgaccac 480 

gctaagggcg aattctgcag atatc 5Q5 

<210> 60 

<211> 520 

<212> DNA 

<213> Homo sapien 



<400> 60 

cgtggtcgcg gccgaggtcc tcaggacaag gaaacaggta tcagcatgat ggtagcagaa 



60 
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accttatcac caaggtgcag gagctgactt cttccaaaga gttgtggttc cgggcagcgg 120 

tcattgcctg cccttgctgg agggctgatt ttagtgttgc ttattatgtt ggccctgagg 180 

atgcttcgaa gtgaaaataa gaggctgcag gatcagcggc aacagatgct ctcccgtttg 240 

cactacagct ttcacggaca ccattccaaa aaggggcagg ttgcaaagtt agacttggaa 300 

tgcatggtgc cggtcagtgg gcacgagaac tgctgtctga cctgtgataa aatgagacaa 360 

gcagacctca gcaacgataa gatcctctcg cttgttcact ggggcatgta cagtgggcac 420 

gggaagctgg aattcgtatg acggagtctt atctgaacta cacttactga acagcttgaa 480 

ggacctgccc gggcggccgc tcgaaagggg cgaattctgc 520 

<210> 61 
<211> 447 
<212> DNA 
<213> Homo sapien 

<400> 61 

agagaggtgt ttttattctt tggggacaaa 
gttctccagg ctgtagggcc cagaggctta 
tcttttttcc cgttggttta tttgtagtcc 
gggaaaataa tccaaacgtt tttcttttaa 
aggcttgcta tataggtaaa ttgcatgtca 
tccagataaa aagcatagta ccagataggt 
tccgacctca ggtaggcccc agtgtctgac 
tctgcagata tccatcacac tggccgg 





<210> 


62 
























<211> 


83 
























<212> 


PRT 
























<213> 


Homo sapien 




















<400> 


62 






















Lys 


Lys Val 


Leu 


Leu 


Leu lie 


Thr 


Ala 


lie Leu 


Ala 


Val 


Ala 


Val 


Gly 


1 




5 








10 








15 




Phe 


Pro Val 


Ser 


Gin 


Asp Gin 


Glu 


Arg 


Glu Lys Arg 


Ser 


lie 


Ser 


Asp 






20 








25 








30 






Ser 


Asp Glu 


Leu 


Ala 


Ser Gly 


Phe 


Phe 


Val Phe 


Pro 


Tyr 


Pro 


Tyr 


Pro 




35 








40 








45 








Phe 


Arg Pro 


Leu 


Pro 


Pro lie 


Pro 


Phe 


Pro Arg 


Phe 


Pro 


Trp 


Phe 


Arg 




50 






55 








60 










Arg 


Asn Phe 


Pro 


lie 


Pro lie 


Pro 


Ser 


Ala Pro 


Thr 


Thr 


Pro 


Leu 


Pro 


65 








70 






75 










80 


Ser 


Glu Lys 

























gccgggttct gtgggtgtag 
atcagaattt tcagacaaaa 
ttgggcaaac caatgtcttt 
cttttttttt aggttcaggg 
ccagggtttg ttgtacagat 
agttttttga tcctcaccct 
ctgcccggcg gcccgctcga 



gattctccag 60 

ctggaacctt 120 

gttcgaaaga 180 

gcacatgtgt 240 

tatttcatca 300 

ccttccatgc 360 

aagggccaat 420 
447 



<210> 63 

<211> 683 

<212> DNA 

<213> Homo sapien 

<400> 63 

acaaagattg gtagctttta tattttttta aaaatgctat actaagagaa aaaacaaaag 60 
accacaacaa tattccaaat tataggttga gagaatgtga ctatgaagaa agtattctaa 120 
ccaactaaaa aaaatattga aaccactttt gattgaagca aaatgaataa tgctagattt 



180 



aaaaacagtg tgaaatcaca ctttggtctg taaacatatt tagctttgct tttcattcag 240 



300 



atgtatacat aaacttattt aaaatgtcat ttaagtgaac cattccaagg cataataaaa 

aaagwggtag caaatgaaaa ttaaagcatt tattttggta gttcttcaat aatgatrcga 360 

gaaactgaat tccatccagt agaagcatct ccttttgggt aatctgaaca agtrccaacc 420 

cagatagcaa catccactaa tccagcacca attccttcac aaagtccttc cacagaagaa 480 

gtgcgatgaa tattaattgt tgaattcatt tcagggcttc cttggtccaa ataaattata 540 
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gcttcaatgg gaagaggtcc tgaacattca gctccattga atgtgaaata ccaacgctga 600 
cagcatgcat ttctgcattt tagccgaagt gagccactga acaaaactct tagagcacta 660 
tttgaacgca tctttgtaaa tgt 683 

<210> 64 
<211> 749 
<212> DWA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 534 

<223> n = A,T,C or G 



<400> 64 

ctgttcattt gtccgccagc tcctggactg 
cctccgtgta aatgttttat gtgttcgcct 
tatttgtcat ttgtatttat tatctctgtg 
gtgttcattt gaacccattt actgatctct 
ttctcctcag aagtcgggta gatagcattt 
gcaacagtat ttattgctca gggtcttctg 
aagcattgat -bgagacattt gcacaatcta 
caccctctac ttgggct-tta tactgcatac 
ggatgtggat ctccaaataa agatttagt-g 
gatctgaaca cctctccttt gtatcaataa 
aagtatagta aaatgctgac atctaaaact 
tagtcatact cacacaaagg gagaaattta 
atagcatgga aaaacaatgc ttccagtgg 

<210> 65 
<211> 612 
<212> DNA 

<213> Homo sapiens 



gatgtgtgaa aggcatcaca tttccatttt 60 

actgatccca ttcgt-tgctt ctattgtaaa 120 

ttttccccct aaggcataaa atggtttact 180 

gttgtatatt tttcatgcca ctgctttgtt 240 

ctatcccatc cctcacgtta ttggaagcat 300 

cttaaaactg aggaaggtcc acattcctgc 360 

aaatgtaagc aaagtaagtc attaaaaata 420 

aaatttactc atgagccttc ctttgaggaa 480 

tttattttga gctctgcatc ttancaagat 540 

atagccctgt tattctgaag tgagaggacc 600 

aaataaatag aaaacaccag gccagaacta 660 

aactcgaacc aagcaaaagg cttcacggaa 720 

749 



<400> 65 

acagcagcag tagatggctg caacaacctt 
ccccacccca ggatccggga ccaaaataaa 
gggtagggct cagtgccaca ttactgtgct 
ctttaaaaat agaggagtaa gcaggactgg 
ggagagacca tttggcgcca gtcccctagg 
aggcgctaag aagagtagga ggggtccact 
accaacagga cactgactct aggtttatga 
tgggagaaat caccattttg tgacttctaa 
atggatgcta aaacgattag gtgaaaagtt 
ctgataccat ctgaaaccat ttggcatcat 
gaggaagggg ag 



cctcctaccc cagcccagaa aatatttctg 60 
gagcaagcag gcccccttca ctgaggtgct 120 
ttgagaaaga ggaaggggat ttgtttggca 180 
agaggccaga gaagatacca aaattggcag 240 
agatgggagg agggagatag gtatgagggt 300 
ccaagtggca gggtgctgaa atgggctagg 360 
cctgtccata cccgttccac agcagctggg 420 
taaaataatg ggtctaggca acagttttca 480 
gatggagaat tttaattcag gggaattagg 540 
taaaaatgtg acaacctggt ggctgccagg 600 

612 



<210> 66 
<211> 703 
<212> DNA 

<213> Homo sapiens 



<400> 66 

tagcgtggtc gcggccgagg tacattgatg 
gcacagaacc aagaattaca gaaaaaagtc 
gtagctcagc tccgccagct gcagacgcta 
accagcactt gtgttttgat tcttcttttt 
agtccattcc agagtcgacc agaagctggg 



ggctggagag cagggttggc agcctgttct 60 
caggagctgg agaggcacaa catctccttg 120 
attgctcaaa cttccaacaa agctgcccag 180 
tccctggctc tcatcatcct gcccagcttc 240 
tctgaggatt accagcctca cggagtgact 300 
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tccagaaata tcctgaccca caaggacgta 
tccagactga gggagccacc tggagccaag 
gagaagatgg gagggaagcc aagacccagt 
gagatgtgag ctggaacaga ccttcctggc 
tccttatggc tttgcttccc actgggattc 
atcacttcag gacaccccaa gagatgtcct 
tgtttgcata tatgagaggg tacctgcccg 

<210> 67 

<211> 1022 

<212> DNA 

<213> Homo sapiens 



acagaaaatc tggagaccca agtggtagag 360 
gatgcaaatg gctcaacaag gacactgctt 420 
gggcgcatcc ggtccgtgct gcatgcagat 480 
ccacttcctg atcacaagga atcctgggct 540 
ctacttaggt gtctgccctc aggggtccaa 600 
ttagtctctg cctgaggcct agtctgcatt 660 
ggcggccgct cga 703 



<400> 67 

cttgagaaag caggattgtt ttaagttcca 
attcaagcct aaaaggaaga taggattttc 
accatggatg aactgtttct cagcactgtg 
gaggagacag tatgacatag gtgggtaggt 
cctcaactcc actccagctg ttcctgttcc 
tcactcagtg tgtcaccaaa ggcagcttca 
tcttcacctt ctgctgcctc tttctgctgc 
gcattgtcct cagtgtgtcc aggccccaga 
ttcccaggcc ttaatcctta accctagacc 
ctacctaata gctatctacc agtcattaaa 
ctgatgctag agataatttt gttgaatccc 
aaggactagg gaggcaatca gtattattta 
ggcttcatat ttctatcata atccctgggg 
ggggaaaacg gtttgcaaca ttctcctcct 
tgcccctcca tcaattcacc ctatactcag 
attttctaaa ttcctagccc ctgctggtga 
tccccgtgtt cgtctccggg ccctgagact 
ct 



agatttaaca aacttactgt tcagcatcat 60 
aagatatatt tccaacttct ttaacatggc 120 
ctgcttcact tggaattaag gatgaattgg 180 
tgggtggtga ggggaaccag ttctaatagt 240 
acacggtcca ctgagctggc ccagtccctt 300 
aggctcaatg gcaagagacc acctataacc 360 
cactgactgc catggccatc tgctatagcc 420 
caaggaaggg gagccatggt gagactccaa 480 
tgttgcctct agcatcattt atttatctac 540 
ccatggtgag attctaacca tgtctagcac 600 
ttcaattata aacagctgag ttagctggac 660 
ttcttgaaca ccatcaagtc tagacttggt 720 
gtaagaaatc atatagcccc aggttgggaa 780 
tgtaggaggc gagctctgtc tcactagcta 840 
atcagaagct gagtgtctga attacagtat 900 
atttgccctc ccccgctcct ttgacaattg 960 
ggccctgctt atcttgctga ccttcatcct 1020 

1022 



<210> 68 

<211> 449 

<212> DNA 

<213> Homo sapiens 



<400> 68 

ccagatccat 

ggacattagg 

agagatttcc 

gcaacaagcc 

agggagaaag 

atattcccaa 

cttttttccc 

gaaataaggg 



tttcagtggt 
ccactatgtg 
tgggtctgcc 
actccaggac 
gggtagtgat 
aaagaggctg 
tttattactg 
aggtctagag 



ctggatttct 
ttgttactgc 
agaggcccag 
aaggttcaaa 
acaagtctca 
agacaggagg 
ttgtagtccc 
cttctattc 



ttttattttc 
cactagtgtt 
acaggctcac 
atggttacaa 
tagccagaga 
ttattttcaa 
tcacttggat 



ttttcaactt 
caagtgcctc 
tcaagctctt 
cagcctctac 
tggttttcca 
ttttattttg 
atacctctgt 



gaaagaaact 60 
ttgttttccc 120 
taac-tgaaaa 180 
ctgtcgcccc 240 
ctccttctag 300 
gaattaaata 360 
tttcacgata 420 
449 



<210> 69 

<211> 387 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 22, 26, 36, 45, 54, 56, 62, 63, 73, 92, 98, 105, 155, 174, 
194, 302, 312, 358, 375, 378, 381 
<223> n = A,T,C or G 
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<400> 69 

gcccttagcg tgggtcgcgg cncgangtct 
cnnatactgc tantctcatt tattctcctg 
attattgcct gttaacactg gactgtgagt 
gagggtatta tcanatattg caatctgtac 
cttaaccttc atctttgtct gttaacacta 
gngatctcat tnggtataac tacactcttt 
acctgcccgg gcggncgntc naagggc 

<210> 70 

<211> 836 - 

<212> DNA 

<213> Homo sapiens 



ggagcntatg tgatncctat ggtncncagg 60 
cnacctantc ctctnctctg gaatcacacc 120 
accangcaat taatttgcac caanaaagtt 180 
agagggaaga tgatttcaat ttgatttcaa 240 
atagagggtg tctaataaaa tggcaaattt 300 
ttcacagatg tgatgactga atttccanca 360 

387 



<400> 70 

tattccattt acaaaataaa ttcagccctg 

tggagcttag tgctactgaa taccctggcc 

accctagttt atttatttat agatatctgt 

accatctgaa atgtactttt tttctgaatg 

aaaacaatga tgtaaattcc ttatgataaa 

taaatatgcc gaagccaagc acacagtctt 

gaatgatact gatcttatat ctgttaaaag 

tcatatttgc caagtcttct gtaaagatgt 

tgtatttgta aaccagattt gtttaccact 

agtttatttc ttccacgtac ttaaattttc 

tttctttcac aaaggcaggt tcaaaattct 

gtataacacc agagcttgct gtttaaagga 

gctcagcagt ttaacatgtg aatcctgttt 

attgatataa cgctgtaatt aaaaatgttt 

<210> 71 

<211> 618 

<212> DNA 

<213> Homo sapiens 



cactttcttt agatgccttg atttccagaa 60 

acagagccac ctcaggatat tcttttctcc 120 

ttacaaagtc tgtagtaaat cctgatgctg 180 

ctgtttcaat ctaaaatagc agcttttgag 240 

aggatgattc tatatattct ttaatgatat 300 

tctaaagtgt gtgtatgttt gtgtgaatgt 360 

ttgttttaaa aagctgtggc atcccattgt 4 20 

ctaggacgaa atattttatg tgctaatgca 480 

caaaattaac ttgttttctt catccaaaaa 540 

tgtgtgggta taatatagct ttctaatttt 600 

gttgaaagaa aaatgctttc tgaaactgag 660 

ttatatgatg tacatcagtt ctataaatgt 720 

taaagtgctc agatttcaac tgtgtaagcc 780 

atatgaaaaa aaaaaaaaaa aaaaaa 836 



<400> 71 

gttgcagtga gctcaagtgt tgggtgtatc 

tccacaggag caatttgttt accttttttt 

tttaaatcat tagtagatcc tagaggagcc 

accacccgta gttgtgcatt gaaataatta 

tttctcattc tttattcatt tattcaacaa 

aagatgagat aatccgctcc ttggcagtgt 

aggtaatttt cacacaaagt aatagatatc 

acacacagat aggacatcca ggttttgggt 

atgcaggttg atrccagaag gacaacaaaa 

gccaagagct ggagctgagg aagatcactg 

gctagagcaa agaggtgg 

<210> 72 

<211> 806 

<212> DNA 

<213> Homo sapiens 



agctcaaaac accatgtgat gccaatcatc 60 
tctgatgctt tactaacttc atcttttaga 120 
agtttcagaa aatatagatt ctagttcagc 180 
tcattatgat tatgtatcag agcttctggt 240 
ccacgtgaca aacactggaa ttacaggatg 300 
tatactatta tataacctga aaaaacaaac 360 
atgacacatt taaaataggg cactactgga 420 
caatattgta gactttttgg tggatgagat 480 
acatatgtca gatagaaggg aggagcaaat 540 
tgaaattcta tgtagtctag ttggctggat 600 

618 



<400> 72 

tctacgatgg ccatttgctc attgtctttc 
tttgcctgct cagagtggcc cctcagaaca 
aggactgtgg tgacaactct ggtcaggtgt 



ctctgtgtgt agtgagtgac cctggcagtg 60 
acagggctgg ccttggaaaa accccaaaac 120 
gatttgacat gagggccgga ggcggttgct 180 
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gacggcagga ctggagaggc tgcgtgcccg 

ggcagatctg ggcactttcc caacccaggt 

agagtggtgg gcagatctga ccatccccac 

ccagtccccc ttcaacccag ttgatgtaac 

tctactcagg ctatgggcct cgtcctcact 

tccgggcccc acgtggctcc tgtgctctag 

aatcgatgcc acggattgca ggccaaattt 

ccctttaatg ggattgaaag cacttttacc 

gcttttctac aaggtccact atttctgagt 

taatgaaagc tgatgaattt tcttttctgt 

tttagatgtt gaaaaaaaaa aaaaaa 

<210> 73 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 59 

<223> n = A,T,C or G 



gcactggcag cgaggctcgt gtgtccccca 240 

ttatgccgtc tccagggaag cctcggtgcc 300 

agaccagaaa caaggaattt ctgggattac 360 

cacctcattt tttacaaata cagaatctat 420 

cagttattgc gagtgttgct gtccgcatgc 480 

atcatggtga ctcccccgcc ctgtggttgg 540 

cagatcgtgt ttccaaacac ccttgctgtg 600 

acatggagaa atatattttt aatttgtgat 660 

ttaatgtgtt tccaacactt aaggagactc 720 

ccaaacaagt aaaataaaaa taaaagtcta 780 

806 



<400> 73 

actctggtaa gcttgttgtt gtccaagtga 
gagccattgt caacagcaga gatgctgttg 
agtcctttca ggctagctgc atcaactctg 
gtaagggaag gaggaagacc ttgaggaatg 
tasgaagagc ttcttcmttc cctggaaagc 

g 

<210> 74 
<211> 401 
<212> DNA 

<213> Homo sapiens 



agctccctca gatgaggcgt gttggccana 60 
aaactcaatc ccaacttagc caaattattc 120 
ctgattttgt tgccatcaag atgtaattcc 180 
ctggygatat tggyatcagc aatgcggatg 240 
cccattttca atyccttgag ctcttcakcg 300 

301 



<400> 74 

agtttacatg atccctgtaa cagccatggt 

agtgtgttct ggatacagag cacatcgtgg 

ggtccacaga gcactcatct ggctgggcta 

gcagttacca gcacattcaa acagtgtatt 

aagaaggcaa cataataatg ttatcagaaa 

gcttgaggct gaaaagtagc ttgccagctt 

ccggaacagc aagatgtgag gttctggttc 

<210> 75 
<211> 612 
<212> DNA 

<213> Homo sapiens 



ctcaaactca gatgcttcct ccatctgcca 60 
cttctggggt cacactcagc ttaggctgtg 120 
tggtggtggt ggctctactc aagaagcaaa 180 
gaacatcttt taaatatcaa agtgagaaac 240 
gatgttagga agtaaggaca gctgtgtaaa 300 
catttctttg gtttcttggg tagtgggccg 360 
atggatcata t 401 



<400> 75 

ttatttttca atttttattt tggttttctt 

aagagtgttg aaaaaaaaat 'tcaaattttt 

tattgcacaa tgctctgatc aatccttctt 

gatgtgcaga agaaatggaa ggattcagct 

aaagagaaag ttttttcaaa tttctttctt 

aacagactgg gccaatgtcc acaaagaatt 

caatatcaag gaagggcagg cgtgatggct 

cccattcatt tgtcttttta gagcagccat 



acaaaggttg acattttcca taacaggtgt 60 
ggggagcgag ggaaggagtt aatgaaactg 120 
tttctctttt gcccacaatt taagcaagta 180 
ttcagttaaa aaagaagaag aagaaatggc 240 
ttttaattta gattgagttc atttatttga 300 
cctggtcagc accaccgatg tccaaaggtg 360 
tatttgtttt gtattcaatg attgtctttc 420 
ctacaagaac agtgtaagtg aacctgctgt 480 
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tgccctcagc aacaagttca acatcattag agccctgtag aatgacagcc tttttcaggt 540 
tgccagtctc ctcatccatg tatgcaatgc tgttcttgca gtggtaggtg atgttctgag 600 
aggcatagtt gg 612 

<210> 76 

<211> 844 

<212> DNA 

<213> Homo sapiens 



<400> 76 

ggctttcgag cggccgcccg ggcaggtctg 
gcagagacct gaaattctgc catcctgaac 
accaaggatg caaattggat gctatcaagg 
gcataagtgc caatcctttg aatgttccac 
agaagaaaca cgtttggttt ggagagtcca 
atcctgaact tcctgaagat gtccttgatg 
gccgagcttc ccagaacatc acatatcact 
ccagtggaaa tgtaaagaag gccctgaagc 
ctgaaggaaa tagcaaattc acctacacag 
gggaatggag caaaacagtc tttgaatatc 
ccaccaatgt ccagaggtgc aatgtcaagg 
tattcaatga ttgtcttgcc ccattcattt 
gagtaggtga acctgctgtt gccctcagca 
agcacagcct tgttcaarct gcccgtctcc 
aatc 



atggttctcg taaaaacccc gctagaaact 60 
tcaagagtgg agaatactgg gttgacccta 120 
tattctgtaa tatggaaact ggggaaacat 180 
ggaaacactg gtggacagat tctagtgctg 240 
tggatggtgg ttttcagttt agctacggca 300 
tgcagcykgc attccttcga cttctctcca 360 
gcaaaaatag cattgcatac atggatcagg 420 
tgatggggtc aaatgaaggt gaattcaagg 480 
ttctggagga tggttgcacg aaacacactg 540 
gaacacgcaa tgctgttcct tgacattgca 600 
aacggcaggc gagatggctt atttgttttg 660 
gtctttttgg agcagccatc gactaggaca 720 
acaagttcca catcgttgga accctgcaga 780 
tcatccagat acctcggccg cgaccacgct 840 

844 



<210> 77 

<211> 314 

<212> DNA 

<213> Homo sapiens 



<400> 77 

ccagtcctcc acttggcctg atgagagtgg 
gacacttaga tttctctctt gtgggaagaa 
ttgatgtgga aattgctgct gctaccacca 
ccagccatcc tggcatcctg gccctcgagc 
agccg-tgtct ttatgtcaag cagcatcttg 
cggagctcac tcag 

<210> 78 

<211> 548 

<212> DNA 

<213> Homo sapiens 



ggagtggcaa gggacgtttc tcctgcaata 60 
accacctgtc catccactga ctcttctaca 120 
cctcctgaag aggcttccct gatgccaatg 180 
aggctgcggt aagtagcgat ctcctgctcc 240 
tactcctggt tctgagcctc catctcgcat 300 

314 



<400> 78 

accaagagcc aagtgttaca caggatattt 
tcccatgcta tcttctaaga taactacaaa 
aaggacctcc caggactcta tccagaatga 
gccctagcga taattaggaa atcacaggca 
ccaatcagtc tgcacattgg ttttgttaga 
tagtgcagct ctgtgcctac agagagcctc 
gagacaaagc tgctatgggt ctaaaacctt 
tgggactctg agacagacgg tggtaaaacc 
ttgagctctt tctgaagctg ttgcttccta 
gcacagtc 



taaaaataaa atgtttttgg aatcctcacc 60 
tattcttcaa agatttaact gagttctgcc 120 
ttattgtaaa gctttacaaa tcccaccttg 180 
aacctcctct ctcggagacc aatgaccagg 240 
tactttgtgg agaaaaacaa aggctcgtga 300 
ccttttggtt ctgaaattgc tgatgtgaca 360 
caataaagta actaatgaca ctcaaggtcc 420 
cacagctgcg attcacattt ccaatttatt 480 
cctgagaatt cccatttaga gagctgcaca 540 

548 



<210> 
<211> 



79 
646 
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<212> DNA 

<213> Homo sapiens 



<400> 79 

accccgtcac tatgtgaata aaggcagcta 
ggcaacagcc catattaaga cttctagaac 
tgcatgggaa aagggcttta gtatagttta 
aagatatgca tagtggggga ataaagcctc 
atcttagaac cgagggattt gtttagattg 
tttgaatttt caatgatagg acttattgga 
taatattcat tttttaaaaa ctcatcttgg 
ttgacataag cccatatttc attttaacca 
aataggcaac aatgtatttt ataagcactg 
ctttagaaac aacttctgac acttgagggt 
tgaggtaaat gcaaaccagg gagccaccga 

<210> 80 

<211> 276 

<212> DNA 

<213> Homo sapiens 



gaaaatggac tcaattctgc aagccttcat 60 

aagttaaaaa aaatcttcca tttccatcca 120 

ggatggatgt gtgtataata ataaaatgat 180 

agagtccttc cagtatgggg aatccattgt 240 

ttgatctact aatttttttc ttcacttata 300 

aattggggat aattctgttg tggtattaaa 360 

tattgagtta gtgcattgac ttccaatgaa 420 

gaaacaaaaa ctagaaaatg ttactcccta 480 

cagagattta gtaaaaaaca tgtatagtta 540 

tacccaatgg tctccttccc attctttata 600 

ataaacagcc ctgagt 646 



<220> 

<221> misc_feature 

<222> 16, 29, 32, 45, 53, 55, 58, 59, 65, 66, 75, 77, 85, 90, 97, 
109, 112, 163, 170 
<223> n = A,T,C or G 



<400> 80 

gtctgaatga gcttcnctgc gagatgganc 
aacgnnaaaa cccgntngaa caagnaaacn 
gacgcccacc tctcctcctc ccagttctcc 
tcttccagcc gccaaatccg caccaaggtc 
cacccacgaa caggtccttc gcaccaagaa 

<210> 81 

<211> 647 

<212> DNA 

<213> Homo sapiens 



ancataaccc agaantccaa aancntanng 60 

gcaactnacg gccgcctgnt gnagggcgag 120 

tctggatcgc agncatccan agatgtgacc 180 

atggatgtgc acgatggcaa ggtgggtgtc 240 
ctgagg 276 



<400> 81 

gtcctgcctt tcatcttttc tttaaaaaaa 
tttaaaattc atggaagtaa taaacagtaa 
cagaaaaaca taaccataaa atattgttcc 
gttagcaaca cgtagacatt catacatatc 
gccatccaaa cgcaaatgct tgatcttgga 
gctctttatg tcaaacttct caagttgatt 
attgacagtt ggtatgtttt taagcttgtt 
cacattgaaa gaatttccag gtattccact 
atactgcaat gcattaaaac gcttgaaata 
gtctaagtag agagttagaa gagagacagg 
gaagctcaag tcaaggtatt cgagtgattt 

<210> 82 

<211> 878 

<212> DNA 

<213> Homo sapiens 



ataaatgttt acaaaacatt tccctcagat 60 
taaaatatgg atactatgaa aactgacaca 120 
aggatacaga tattaattaa gagtgacttc 180 
cggtggaaga ctggtttctg agatgcgatt 240 
gtaggrtaat ggccccagga tcttgcagaa 300 
gacctccagg taatagtttt caaggttttc 360 
ataggacaga tccagctcaa ccagggatga 420 
atcagccagt tcgttgtgag ataaacgcag 480 
ctcatcaggg atgttgctga tcttattgtt 540 
gagaccagaa ggcagtctgg ctatctgatt 600 
aagaccttta aaagcag 64 7 



<400> 82 
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ccttctttcc ccactcaatt cttcctgccc 
gtcagacaca atcagaatya cagaaaaatc 
ctatgatatc aatgaatgtg ggttaagtaa 
aatattaagt gtggacagac ctatttcaaa 
ctgatccagg gagatcaccc ctctaattat 
atttttatga agcagccact gtatgatatt 
ccttcccttg gaccaccttc atgttagttg 
tatattatgt ttatacaaaa tcaatctgaa 
ttcctcactg gccccctcca cctgcccata 
taagatcaac aatgaagtat tttataaatg 
tttccatttt caaattattt agaattctta 
aatcatgtaa aatgaaactg ttgctccatt 
gctatatgct aaaaagagaa aatatggtca 
tattatcata gactaatgac atttatcttc 
tgtgattaca ggtagagaac ctcggccgcg 

<210> 83 

<211> 645 

<212> 0NA 

<213> Homo sapiens 



tgttattaat taagatatct tcagcttgta 60 
ctgcctaagg caaagaaata taagacaaga 120 
tagatttcca gctaaattgg tctaaaaaag 180 
ggagcttaat tgatctcact tgttttagtt 240 
ttctgaactt ggttaataaa agtttataag 300 
ttaagcaaat atgttattta aaatattgat 360 
ggtattataa ataagagata caaccatgaa 420 
cacaattcat aaagatttct cttttatacc 480 
gtcaccaaat tctgttttaa atcaatgacc 540 
tatttatgct gctagactgt gggtcaaatg 600 
tgagtttaaa atttgtaaat ttctaaatcc 660 
ggagtagtct cccacctaaa tatcaagatg 720 
agtctaaaat ggctaattgt cctatgatgc 780 
aaaacaccaa attgtcttta gaaaaattaa 840 
accacgct gys 



<400> 83 

acaaacattt tacaaaaaag aacattacca 
ataaatagac tgagtttccg ggcaatgtct 
cagctgaaac aggcttcttt cccagtgaca 
taaatgaggc tactacatag gcccagttaa 
atacaagtgg aactcatcaa ataatttaaa 
taaactcatt taagccttca caatgtcgca 
gttgtcatac agatacttgt ttttacacat 
cagtcaggtt tcctgttgtt ggaccgaaag 
acagaagtga gaaagaaagg agaccctgag 
aaaagggagg gggaaggcag gaatttgaaa 
aggaagcgtg actcacttgg gtctgggacg 

<210> 84 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> 270, 284 
<223> n = A,T,C or G 



atatcagtgg cagtaagggc aagctgaaga 60 
gtcctcaaag acatccaaac tgcgttcagg 120 
agcatatgtg gtcagtaata caaacgatgg 180 
caaactcctc ttctcctcgg gtaggccatg 240 
cccaaggcga taacaacgct atttcccatc 300 
atggattcag ttacttgcaa acgatcccgg 360 
aacgctgtgc catcccttcc ttcactgccc 420 
gggatacatt ttagaaatgc ttccctcaag 480 
gccaggatct attaaacctg gtgtgtgcgc 540 
ggataaacgt ctcctttgcg ccgaggaatc 600 
ataccgaaat ccggt 645 



<400> 84 

tctgatgtca atcacaactt gaaggatgcc 
cctcttatct cctatgctgg agaaggatta 
gcacctctaa tcatcgatga gaatggagtt 
agataccaag ttttgtttgc cacgatagga 
aacctacaag acgtcttgga caactgaagn 

g 

<210> 85 

<211> 296 

<212> DNA 

<213> Homo sapiens 



aatgatgtac caatccaatg tgaaatctct 60 
gaaggttatg tggcagataa agaattccat 120 
catgggctgg tgaaaaatgg tatttgaacc 180 
atagctttta tttttgatag accaactgtg 240 
ttaaatatcc acangggttt attttgcttg 300 

301 



<220> 
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<221> inisc_f eature 
<222> 16, 20, 240 
<223> n = A,T,C or G 



<400> 85 

agcgtgggtc gcggcncgan gtagagaacc 
cctcctgatc acagccatct tggcagtggc 
acgagaaaaa agaagtatca gtgacagcga 
ttacccatat ccatttcgcc cacttccacc 
acgtaatttt cctattccaa tacctgaatc 

<210> 86 

<211> 806 

<212> DNA 

<213> Homo sapiens 



gactgaaacg tttgagatga agaaagttct 60 

tgttggtttc ccagtctctc aagaccagga 120 

tgaattagct tcagggtttt ttgtgttccc 180 

aattccattt ccaagatttc catggtttan 240 

tgcccctaca actccccttc ctagcg 296 



<400> 86 

tctacgatgg ccatttgctc attgtc-tttc 
tttgcctgct cagagtggcc cctcagaaca 
aggactgtgg tgacaactct ggtcaggtgt 
gacggcagga ctggagaggc tgcgtgcccg 
ggcagatctg ggcactttcc caacccaggt 
agagtggtgg gcagatctga ccatccccac 
ccagtccccc ttcaacccag ttgatgtaac 
tctactcagg ctatgggcct cgtcctcact 
tccgggcccc acgtggctcc tgtgctctag 
aatcgatgcc acggattgca ggccaaattt 
ccctttaatg ggattgaaag cacttttacc 
gcttttctac aaggtccact atttctgagt 
taatgaaagc tgatgaattt tcttttctgt 
tttagatgtt gaaaaaaaaa aaaaaa 

<210> 87 

<2H> 620 

<212> DNA 

<213> Homo sapiens 



ctctgtgtgt agtgagtgac cctggcagtg 60 
acagggctgg ccttggaaaa accccaaaac 120 
gatttgacat gagggccgga ggcggttgct 180 
gcactggcag cgaggctcgt gtgtccccca 240 
ttatgcogtc tccagggaag cctcggtgcc 300 
agaccagaaa caaggaattt ctgggattac 360 
cacctcattt tttacaaata cagaatctat 420 
cagttattgc gagtgttgct gtccgcatgc 480 
atcatggtga ctcccccgcc ctgtggttgg 540 
cagatcgtgt ttccaaacac ccttgctgtg 600 
acatggagaa atatattttt aatttgtgat 660 
ttaatgtgtt tccaacactt aaggagactc 720 
ccaaacaagt aaaataaaaa taaaagtcta 780 

806 



<400> 87 

tttttgcatc agatctgaaa tgtctgagag 
atttttctgc acagtccatt ctgtttttat 
aattatgaca tccttcctct ttgttatttt 
attttagttt catgtaaatt tttgaattgt 
gcaattttaa taggaagttt attgaatcta 
taatattcat gttttcaatt catagacaaa 
atttttcctt tttttattgt aaagatttac 
attatttaag gtatagtcaa taaaattttc 
atgaaaccat agatatactt gtatgttaat 
tattagttta gagaggtttt aatgtactgt 
tttttaaaaa aaaaaaaaaa 

<210> 88 

<211> 308 

<212> DNA 

<213> Homo sapiens 



taatagtttc tgttgaattt ttttttgttc 60 
tactatctag gcttgaaata tatagtttga 120 
cctcatgatt gctttggcta ttcaaagttt 180 
attttccatt attgtgaaaa tagtaccact 240 
tagattactt tggataatat ggcacttcaa 300 
atattttaaa atttatttgt atcttttcta 360 
ctccttggtt aatattttcc tcagaaattt 420 
ttcctctatt ttgtcagata gtttaagtgt 480 
tttatatttt gctaatttac tgagtgtatt 540 
ttatggtttt ttaaatataa gattacttat 600 

620 



<220> 

<221> misc__feature 

<222> 9, 189, 194. 206. 238. 296 
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<223> n = A,T,C or G 
<400> 88 

tagctgtgnt cagcaggccg aggttttttt tttttttgag atggagtctc gccctgtcac 60 
ccaggctgga gtgcagtggc ctgatctcag ctcactgcaa gctccacctc ctggattcac 120 
gctattctcc tgcctcagcc tcccaagtag ctgggactac aggcgcccgc caccacgccc 180 
agctaattnt ttgnattttt agtacnagat gcggtttcat cgtgttagcc agcatggnct 240 
cgatctcctg acctcgtgaa ctgcccgcct cggcctccca aagacctgcc cgggcnggcc 300 
gctcgaaa 

<210> 89 
<211> 492 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 448 

<223> n = A,T,C or G 
<400> 89 

agcggccgcc cgggcaggtc tgttaagtaa catacatatc accttaataa aaatcaagat 60 
gaaatgtttt agaaactatt ttatcaaaag tggctctgat acaaagactt gtacatgatt 120 
gttcacagca gcactattaa tgccaaaaag tagacaaaac ctaaatgtoc attaactgat 180 
aagcaaaatg tggtatatcc atacaatgga atattatgta gcccacaaca tggcatggag 240 
tactacaaca tggatgagcc tcaaaaacgt tatgctaaat gaaaaaagtc agatatagga 300 
aaccacatgt catatgatcc catttatatg aaatagccag aaaaggcaag tcatagaaac 360 
aagat agate ggaaaatggg ttggaggact acaaatggca ccagggatct ttgaagttga 420 
tggaaatggt ctaaaatcag actgtggntg tggttgaaca agtctgtaaa tttaccaaaa 480 
tgcgttaata ca 

<210> 90 
<211> 390 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> 106, 184, 206, 209, 234, 314 
<223> n = A,T,C or G 

<400> 90 

tcgagcggcc gcccgggcag gtacaagctt tttttttttt tttttttttt ttttctaaca 60 
gttctctgtt ttattgcaat acagcaaagt ctggttaata ttaagngata tcaacataaa 120 
gtattggtga ggagtctttt gtgacatttt ttaccatccc accttaaata tttctgtgca 180 
aaanaatcca catcattgtt tggtancana ggatctctta aaaagttccc taanacactg 240 
agggcataaa accaaacaaa ataaaataag gagtgatagg ctaaagcagt atcttcccct 300 
ccatccacat ttgncaagca ttatattcta accaaaaaat gatcacacca ggccatgcaa 360 
aactgtccaa tattaccgag aaaaaaccct 390 

<210> 91 
<211> 192 
<212> DNA 

<213> Homo sapiens 
<400> 91 

agcgtggtcg cggccgaggt ctgtcaatta atgctagtcc tcaggattta aaaaataatc 60 
ttaactcaaa gtccaatgca aaaacattaa gttggtaatt actcttgatc ttgaattact 120 
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tccgttacga aagtccttca catttttcaa actaagctac tatatttaag gcctgcccgg 180 
gcggccgctc ga 

<210> 92 

<211> 570 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> iaisc__feature 
<222> 519, 559 
<223> n = A,T,C or G 



<400> 92 

agcgtggtcg cggccgaggt ctgacaacta 
tcctagtatt acacttgcaa gcaattagaa 
ttgaatcact tccaaatcta ctgattttga 
agacaatgtt aactttcgat taagaaagaa 
gccaggttca gtctcttcca gtgagcccgc 
gctgggctgg cagcaccatg taaaaagaag 
atgtaaagta ggatcaagta atggatgaca 
tcagaaaagt acaggcacca gtacaagcag 
aaaataggct tatttaaata ggatgctaca 
ttacactggg tggcattgna ccatatgcat 



acaaagaagc aaaaactggc atcttggaca 60 
cacaaggagg gccaaggaaa aagtttagct 120 
ggttccgcag tagttctaac aaaacttttc 180 
aaaaacccca aacatcttca ggaattccat 240 
ttgctaaaag tccacgtgca ccattaatta 300 
cctattcacc accaaccaca cagactagac 360 
accatggtcg tggaatatgg tcaatgagag 420 
cagataacag aattgacggg ccaaaggata 480 
gaacacatnc acttctaatt ggaagctgct 540 

570 



<210> 93 

<211> 446 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 328, 389 
<223> n = A,T,C or G 



<400> 93 

tcgagcggcc gcccgggcag gtccaggttt 
cctaactttt ttgagtctga atatatttaa 
ataggcttaa ttaggaggat taaatgaaat 
agtattagta gttaattctt ttcctttgtt 
ccagtatagg caaaaatact acttgggggt 
ttccctgaat ctgatcttta ctttttgnta 
cccaacattc ttggatttgt gggacagcng 
atcttttctt actttggtgc tttttg 



ttatttagtt gtgtaatctt ggacaagtta 60 
tctgcaaaat gagaatcatg ataatacgtc 120 
aatttatagg tggtgccatg gttacataca 180 
tacttttata gtataggttg gatgaaggtt 240 
aaagtagagt gtgatacttt atttgaaatg 300 
ctgctgcact acccaaatcc aaattttcat 360 
tagcagcttt tccaatataa tctatactac 420 

446 



<210> 94 

<211> 409 

<212> DNA 

<213> Homo sapiens 



<400> 94 

cgagcggccg cccgggcagg tccatcagct 
agaggtcaca tcagttatcg tctatcaggg 
gtcggcacac acgcctctgg atccacccat 
ggagattgca aagcgccaga gccaacactg 
attcactgca ctcggaagaa ttctgcccag 
ttgtcttgat cttgagaccc ccagagctgc 
tggcctgcct tgcagtcatt ctttcttata 



cttctgctta gaatacgagg cagacagtgg 60 
tgatgaccca agaaaggtga gtgagaaggt 120 
gcgagaagcc ctcaagttgc gtatccagga 180 
accatgttga aggcgttctc tccaggctgg 24 0 
ggaatttagt gtgggggtac caggaccagt 300 
tgcatccata gggtgttgca ggactacacc 360 
tgttgaccca tttgcccaa 409 
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<210> 95 

<211> 490 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 486 

<223> n « A,T,C or G 



<400> 95 

tcgagcggcc gcccgggcag gtcctacttg 
ctacctctct tccatgctta actgggttta 
tgggatattc aagtgctgta tttgaacgat 
gggagacttt gttagtacac tactataaac 
tgcatatgaa atgactgatt taatacaaaa 
gttaagtatt acttcagtgg agaacaaaac 
ccagaaagca aacatggttt tagcttcctt 
aattttgtct gccaagtggt tcagaaaata 
aactgngaac 

<210> 96 

<211> 223 

<212> DNA 

<213> Homo sapiens 



tttgcagctt ccacacactg cacctaccta 60 
gaaaggtgag ctatgcgtag aagaactact 120 
aagcctatag ataacagtct gaagctgcaa 180 
aggtaaacta cctgtttgta cttgatatag 240 
ctacagaaca tgcaaaattt tttctgagat 300 
ttacttaacc tttcgctaat gcatgtagta 360 
tactcaaaat atgaacatta agtggttgtg 420 
cattataaat aacctaagtt aaaaaaaaga 480 

490 



<400> 96 

agcgtggtcg cggccgaggt ctggaagccc 
tctctgccag taatgcaatc caacacaata 
ccgccctctg gtacaaggga aacagcacgc 
aatccagtac aactaagcga ggacctgccc 



accctaggac ttgaatggca ccttgtcctt 60 
tgctacaggg aaaacagaat ttccacggtg 120 
aaagcaaaag gccacagagg gctccctgag 180 
gggcggccgc teg 223 



<210> 97 

<211> 527 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> 404, 436, 451, 476 
<223> n A,T,C or 6 



<400> 97 

tcgagcggcc gcccgggcag gtctgtgcag 
tttttagcct gttgctgaaa ttccagttgt 
atgggaaagc ctcggttgca gaaatcaaga 
ggttaaacag atctgggttc aaagcatagt 
tgaagtcatt tcctctcttg aatttcagag 
tcttcataat ctttgtgagg attaaagaag 
aggcattcta caataagtag ttattatttt 
attaccttct cttagnctct tcatatcgaa 
gattggtgct ggttgatgcc caaacttccc 

<210> 98 

<211> 514 

<212> DNA 

<213> Homo sapiens 



gagacactga agtgggtagt gtccataatc 60 
actccttcaa accaaaatgc ttacaggatc 120 
caggcaagtg ggaagataac tcggctttga 180 
ttcactctct gtcttgtgaa gtgtcctggg 240 
aggatgaaaa tataaaaagt ataataacta 300 
acgaagtgtg tgaaaagcta agcacagagc 360 
tggaaccatc ccgnccctag ccccagccca 420 
ngccgtaatc ttgaccttct cttgcnactg 480 
gagatgctgt ctgggaa 527 
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<220> 

<221> misc_feature 
<222> 455 

<223> n = A,T,C or G 



<400> 98 

tcgagcggcc gcccgggcag gtctggctcc 

ctattcctaa aaccttctag gacatctgct 

ctcaatttta cagatgggaa aagtgattct 

ccagcatcag tggctgggct gagactgggc 

cagagctgtg agttctctag ccaaggctgc 

gaggccatga caacctcact cttcacctga 

catataggct tcggagccaa acaggacctc 

acactggcgg ccgttactag tggatcccga 

gcatagctgg ttcctggggt gaaaatggta 



catggccctt ggggtggcct gactctgtca 60 
ccaggaagaa ctttcaacac caaaattcat 120 
gagaccagac cagggtcagg ccaaggtcat 180 
ccagggaacc ctgtctgctc ctctttttcc 240 
actcttgagg gagagccagg aagcatagct 300 
aaatttaacc cgtggcagag gatccaggca 360 
ggccgcgacc acgctaagcc gaattccagc 420 
gcttnggtac caagcttggc gtaatcatgg 480 
tccg 514 



<210> 99 

<211> 530 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 430, 522 
<223> n = A,T,C or G 

<400> 99 

t-cgagcggcc gcccgggcag gtctgaagaa 
gacagggaag ttacagcttg catgacttta 
agtaatcatt gtgctttgtg ttgatctgaa 
ttcaaaaata tttaattcac ttcaaaatgt 
ctaaagcttc aagtcattta gctcaggtac 
acagtggtgt ttcataccat tgacatttgc 
gtaatattca caatgttcag aaaagcaagc 
tggagatggn ctcatatcag ottcataaac 
ttgaacccca atttccagat taagcatatt 

<210> 100 
<211> 529 
<212> DNA 
<213> Homo sapiens 



acaggtataa atttggcagc cagtaatttt 60 

aatatgtaaa tttgaaaata ctgaatttcg 120 

aaatataaca ctggctgtcg aagaagcatg 180 

catacaaatt atggtggttt ctatgcaccc 240 

atactaaagt aatatattaa ttcttccagt 300 

ataccctaga ataatttaag aaagacatgt 360 

aaaaggtcaa ggaacctgct ttggttcttc 420 

attcattcta caaaatagta agctaaccat 480 

ttctcataaa tnatgaagcc 530 



<400> 100 

agcgtggtcg cggccgaggt ccaggcacgg 
gaggctgagg gaggtggatc acttgagtcc 
cgaaacttca tcactaccaa agaagaaaaa 
agtcccagat actctggtgg ctgaggtgag 
gcagtgaact atgattgcac tactgtgctc 
ccaaaagtcc ttgaaggatt ttaggaagtt 
atgttagggt tcttgaatgt ttaattcctc 
tgactgggtg cggggcagtg gcttcatgcc 
caggaacatt gcttgagccc aggacttcaa 

<210> 101 
<211> 277 
<212> DNA 

<213> Homo sapiens 



tggcttatgt gtgtaatccc agcacttggg 60 

aggagtttga gaccagtctg ggcaacatgg 120 

aattagccag gtgtggtggt gtatgcctgt 180 

aggatagctt gagcccagga aattgaggct 240 

cagcttgggc aacagagtga gatcttgtct 300 

gttaaaagtc ttgaaacgat gtttgggggc 360 

taataactgc ttattcaaga gaagcatttc 420 

ccataatccc agtactttgg gaggctgaag 480 

gaacagcctg ggtaacata 529 
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<400> 101 

tcgagcggcc gcccgggcag gtcgcaggaa 
gagggaacga gatcttgagc tggaaatggg 
ctgggattta atgaatttgt ctgaaaaaca 
taatatagct gattatattg atccagccat 
agaagagctg agaacagacc tcggccgcga 

<210> 102 
<211> 490 
<212> DNA 
<213> Homo sapiens 



gaggatggaa actgaggagt ccaggaagaa 60 

agatgattat attttggatc ttcagaagta 120 

tgataagata ccagaaatct gggaaggcca 180 

catgaagaaa ttggaagaat tagaaaaaga 240 
ccacgct 277 



<400> 102 

gcgtggtcgc ggccgaggtc tgacggcttt 
agtcgatggg acagttagag gggatgtgct 
agaaagattt tggtaactag gtgtctcagg 
cctgccctta gtggagagct ggagcttgga 
ggatgttttc ttgggaagct gttttggtcc 
ttggctctag gtgagttgtc atgtgggtaa 
tagggcacaa aactcactct aggtttatat 
tcaccttata agcatctata aattgacttc 
ggccgctcga 

<210> 103 
<211> 490 
<212> DNA 
<213> Homo sapiens 



gctgtcccag agccgcctaa acgcaagaaa 60 
aaagcgtgaa atcagttgtc cttaattttt 120 
gctgggttgg ggtccaaagt gtaaggaccc 180 
gacattaccc cttcatcaga aggaattttc 240 
ttggaagcag tgagagctgg gaagcttctt 300 
gttgaggtta tcttgggata aagggtcttc 360 
tgtatgtagc ttatattttt tactaaggtg 420 
tttttcttag ttgtatgacc tgccccgggc 480 

490 



<400> 103 

gagcggccgc ccgggcaggt ccaaaccagc 
taggtaattt gttcagttca atgtttacaa 
tttaaaacgt gtgcatttac ctttgcgtga 
tgacatttaa aaattattct aatatatcag 
actaaactag aatccttaag ttattctcat 
attatgcagc tctattgttt aagctttctg 
tcaattgtgg gaagctttac aagttatatt 
agccagccag tccacaaaac aggcaagaca 
tcttatgcaa 

<210> 104 
<211> 489 
<212> DNA 
<213> Homo sapiens 



ttgctcataa gtcattaacc aaatccatta 60 

ttcttatgga aaaaattagc aacacacaca 120 

gtgcttaaaa tacatatttc tatttcaaga 180 

cagcaaaaat ataatttgca attacaaaaa 240 

gtttacagtt gtgattcttt aataaatact 300 

gatttggttt aaacacatgc atatatattg 360 

ccatgcactt tttggacaga gttctaacag 420 

aaagttgaat taactggggc aaaataggac 480 

490 



<400> 104 

cgtggtcgcg gccgaggtcc aggctggtct 

cggcctccca aagtgttggg attacaggca 

ttaatgtcag actaggccag agtttctcaa 

ttggagattt tcccctcaat ctctctcctt 

ttttatttat gtactgtgac cctttgaagg 

taacccgtca aggaccaagt ttttgcccct 

tggtatagat ggcttttatc atgaggatca 

tccatattta tcaccgtata gggaggacct 
tttacagca 



cgaactcctg accttgtgat ctgcccgcct 60 
tgagccactg cgcccgaccg agttgaacat 120 
tctttttatt ctcacttccc aaaggagccg 180 
catgaaattt cataccacaa atatagtatg 240 
atcacaaacc aatataatag tttttctttt 300 
gttggaaatg cataaactgg actgatgaat 360 
gaaaaacttg aaattccttg gctacgacac 420 
tggtatgggg aagtagaaac acttctacac 480 

489 



<210> 
<211> 



105 
479 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 142, 453 
<223> n = A,T,C or G 



<400> 105 

gcgtggtcgc ggccgaggtc tgactggctt 
aaaataattg cacctccctc tgctgcttat 
gttagataaa atctgtggct gnctcttcca 
gccctcctat gccccgcatt tagctacaat 
agctttgtct agaaatatgt gaaggtgagg 
ccctgtctgc tgtctcgatg ggcgttgctt 
gcttctcacc accttctggc ttcttttctt 
agaatgtggg ctttggagtt agacagacct 

<210> 106 
<211> 511 
<212> DNA 
<213> Homo sapiens 



cagccccaga agttgagctg gcctttagac 60 
tcccttccgt ttttcatttg agtgtgaaca 120 
ccttgctcta gtttccattg ctgtgagcag 180 
gctgtggact cacttgattc tttttctccg 240 
ttaagtgctt ctctgtgtag atccacttag 300 
cgtctctcct ctcttccatc ctttccattt 360 
aatgcaataa aggcagtttc taacaaagaa 420 
ggntttaaat tctgcttctg gctctccaa 479 



<400> 106 

tcgcggccga ggtccaaaac gtggattcca 
agttggacct gcagtagtat gggaagctca 
accgtccagc gattctagaa catttctagt 
tgggaaatac tgggagacaa gctgaagatt 
ggtatttaag gctactcctt tagctagcta 
acacagagtt acacaatgag catctctgaa 
tactctaaca ggatgaccag gcaaaaggtg 
gacccacatg ttctcaaggc acttcagaac 
gaaagcattt atggaaatac acatccttta 

<210> 107 
<211> 451 
<212> DNA 
<213> Homo sapiens 



atgacctgcc ttgagcccgc ggttgccagg 60 
cggcctaaat accgactgcc ctctgacccc 120 
aggaaagaca tagcaaggga ttttcatgat 180 
tgttaagggc tatgcttctg tcatctttta 240 
ctttgagctg tttaaagtga ctatctccct 300 
agagaatatt accctggatt tccaaagatg 360 
acccggggga ggagtctgtt ataacactcg 420 
tttgggaaat cattttgtac cggatcctca 480 
g 511 



<400> 107 

ggccgcccgg gcaggtccag aatatcaaat 
caccctctta catattggat cttcaattgc 
acgtagttgc atcagcagaa gcaaacccat 
aggctgctaa aaattcacaa gtcaccattc 
aaggaagatc aacaagtttc caaagtgcta 
accaggacgc ctggacccgt gggctctccg 
gcacctccct tccttgagac acaacagaga 
cagaggctcc tcatgcttgc tacagagaag 

<210> 108 
<211> 461 
<212> DNA 
<213> Homo sapiens 



caaaaggtca caaatgttca cttcctcctc 60 
aatagggagt gtaagatggg cattttagag 120 
cttatacaaa tgggttttgg ggataggaaa 180 
cccagaagca atgaatagcc gtagaagacc 240 
aagccagaga tttggccctt ccaaaatacc 300 
catgtcacca ctgactgcca ggatgctgct 360 
gacagtgaag tcacccaaga ctgggatcat 420 
c 451 



<400> 108 

ccgcccgggc aggtcctgaa aacattcaga 

ataatacata atataaaagt ttttgaaaga 

ctatctgatt c-bcaaaagca atggctattt 



ctaatcaaaa tggtactact gtaacttctt 60 
tatagacaca attaacccct aaacaacaca 120 
aacaagatgt aaaaggacaa taacatatca 180 
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aagaactttc acacacctaa agatagcatt 
caaatatttt cacatttcct atgtttgttt 
tgaggtttct ttcaagtata agatttctaa 
aaagaaataa aaagcaaaac gcaatccaac 
tttagatggt tttctgagta cttttttaca 

<210> 109 
<211> 441 
<212> DNA 

<213> Homo sapiens 



tagcagcaag ttagtcagac aaaacaaaca 240 

ttaactttac ttcataaagc cactgataat 300 

aattaaaaac tgtttttgac atatttttat 360 

tatttatatg agtccctctt ctccaacagc 420 

cagaatattt t 461 



<400> 109 

ggccgcccgg gcaggtctga ttataagaga 
ccccgctctg ctctgatcga gaaaagcttc 
cagaaccatg gcactttggg tgaaggtgtg 
tgactaaggg ggcaggaaac aggcaggcac 
gtgatggcct cgattttgaa gctgcactac 
taacaaactt cagcatactg gggaaggaga 
aagaaccatc tctaaaagtt gatgcttgtc 
caacatcttc attcaaccac a 



aagaaatcca gtgacacgag ggcaggcagg 60 

ctgatgtcag ggagatggaa ctgccaccat 120 

tcagcgacca agggggcagg aaatgggcag 180 

atggcaaggt tctcccagcc catcagccca 240 

tgtctgaaaa gcacaattac tggtgactct 300 

ctgtcaagta actgaattgg aaagatgaaa 360 

agaagaataa cctcctttgt gcaagtcttg 420 

441 



<210> 110 
<211> 451 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 260, 361 
<223> n = AfT,C or G 



<400> 110 

ggtcgcggcc gaggtctggg gaaggggtga gaatccctgg gccttgccca gtcctgagct 60 
ctgggtgtct gcagggaagc acagtggtga gttagtgtta aagaaagcat ccagagaggt 120 
aagaggggct tgggtagcac cctttgcctc tgtcacttcc gcaaaaactt cttgttgagg 180 
aggaagatga gaaggttgac attgactttg gccttgttga agagtttcat gacagccaca 240 
ccctcatact ggagctgcan gagatcctga tagtgaagct tgaaatcgct ccatgtccac 300 
acccaggaac ttggcattta cttcaaactt tcctgcctca tctcccggcg tgatgtcaaa 360 
natgacgttt cttgaagtga gaggcgggaa agatcttcaa tt-tccaccaa agacaccctt 420 
tttccaggaa gcttgagcaa caagtgtaat g 451 

<210> 111 
<211> 407 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<222> 26, 33, 36, 79, 105, 111, 133, 149, 186, 206, 220, 239, 245, 
259, 336, 375, 383, 393 
<223> n = A,T,C or G 



<400> 111 

ggccgacgtt cgacctgact tctttngagc 
actgcagaca gtggcccang gcaaagagtg 
ctcttcagtc agnttttcat tcaagctgnt 
atcctnggca caatcccagt tcctanagga 
cacanaaagg atgtgaacng tgtttaatgt 



agntgncact acccgtcttg aggaatgccg 60 
tgcgtcatcg atganattgg naagatggag 120 
cgtcagacgc tgtctacccc agggactata IBO 
aagccactgn ctcttgtaga agaaatcetna 240 
caccaaggga aaacatgaaa ccaccttctg 300 
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ccagatatcg ggacgttgcg tgcagatcaa gcacgnaagt gaagacgcgt gcattccttg 360 
ccttccgtga acgantgccc agntcaagaa gancctgatg gaaccct 407 

<210> 112 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 363 

<223> n « A,T,C or G 



<400> 112 

tcgcggccga ggtcggccga ggtctgacat 
cgtcttaacc acttctgtat tgtgtggttt 
cccctttccc ttaggatggg tatcaattca 
aagcatttgg aagacccagg ctacaaaata 
agggaggcac aaatacccag gaatctctga 
aaatgaccgt catggagacc ctgctaaagg 
agnggagatg attttggccc cactcataga 

<210> 113 
<211> 451 
<212> DNA 
<213> Homo sapiens 



ctgttgtctg tgataaccac ttctgtattg 60 

taactgccta aggcggcaat gggcagtggg 120 

acaatattta taaggcattt actgtgtgct 180 

agacatagtt cctgccctcc aggccagcag 240 

tgggtgtgaa gtgcggtcgt gggccacaga 300 

tcggaccctg agcccaaagg ggtattcaga 360 
tgggtggcaa a 401 



<400> 113 

gtcgcggccg aggtccatat taaaaagtcc 
gaatatgctc catatgccca taatggtgca 
gggttgaaat ttccaatgac ctgagcaagg 
acacccagag ttcaggctta aacagaccta 
agaaaacggc aatgctcaaa ggaatataaa 
ggcctttctc catgtggtaa aaaacaatgg 
tctgaccttt actatgtttg gtaatggaaa 
catcacttaa ttaatactgg gttttcttct 

<210> 114 
<211> 441 
<212> DNA 
<213> Homo sapiens 



atcataaaca aagactcctc ctcatggtat 60 

taacggactt agaaattcca atgagtctta 120 

cagctcccta tagcttctgg ataacatttt 180 

tcaacacaat tattttcgga ttgtctgtct 240 

taagggtggg gggacatatg cttccagcct 300 

aatggctgtg ttaatttttt tttaatcttt 360 

taagtcaggg aaaacaaaat gaacaggtct 420 

t 451 



<400> 114 

ggccgcccgg gcaggtccat cctgtcagag 
acaaagatgg ttcactttct tacacactat 
gaccagttct ctggtccaaa aattatgcaa 
actaagagga cactgttgtt tacatttaat 
gatatggagg cactgctacc cctgatgaac 
aagttccgac tcaacagaga aggcaaacaa 
gagaacttct tgaaacttga cacatgtgca 
ggaagaaaaa aagaacagag a 

<210> 115 
<211> 431 
<212> DNA 
<213> Homo sapiens 



atgggagaag tcacagacgg aatgatggat 60 
gctgacaaga ttgaatctgt tcatttttca 120 
gaggaaggtc agcctttaaa gctacctgac 180 
gtgcctggct caggtaacac ttacccaaag 240 
atggtgattt attctattga taaagccaaa 300 
aaagcagata agaaccgtgc ccgagtagaa 360 
aagacaggaa gcagcacagt ctcggcggga 420 

441 



<220> 
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<221> misc_feature 
<222> 317 

<223> n = A^T^C or G 



<400> 115 

gccgcccggg caggtccatt ggcggtgaca 
taatgctgat tagttagaag aaagggctag 
tttaaaagag aatgctgact gttaatgttt 
aatttttacc ctttgtgcat gaatattcta 
ttatgaaagt taaacttttt attataaaaa 
catgacacac tatttancat cagttgcata 
cccgaacaat catctccatc tggaagacgt 
ttctctagaa c 



aaaggaaaag aagcaaagag actcagtcca 60 
gattgagaaa gtaccaggaa cttttaatta 120 
taaatcttac tgttcaaatg tactaatatg 180 
aacaactaga agacctccac aatttagcag 240 
ttctaaacct tactgctcct ttaccaggaa 300 
cctcgccaat agtataattc aactgtcttg 360 
aagcctttag aaacacattt ttctattaat 420 



<210> 116 
<211> 421 
<212> DNA 
<213> Homo sapiens 



<400> 116 

gtcgcggccg aggtccagaa atgaagaaga 
aggcagagtg gtgtcaaatc tttgacggca 
ttgaggaggt tgttcatcat gatcacaaca 
gagcaggacg tgagcccccg ccctgcacct 
ttcaaaaggg atcctttcat aggagaacac 
agcccgcgaa gagatttatc aagcttaact 
aagctaagtc tctaacttcc aggcccacgg 



agtttgcaga tgtatttgca aagaagacga 60 
cagatgcctg tgtgactccg gttctgactt 120 
aggaaccggg gctcgtttat caccagtgag 180 
ctgctgttaa acaccccagc catcccttct 240 
actgaggaga tacttgaaga atttggattc 300 
cagataaaat cattgaaagt aataaggtaa 360 
ctcaagtgaa tttcgaatac tgcatttaca 420 

421 



<210> 117 
<211> 489 
<212> DNA 
<213> Homo sapiens 



<400> 117 

agcgtggtcg cggccgaggt aaggctgcga 
gagggctaaa tccatgaagt ttgtggatgg 
ctactacgtt gacactgctg tgcgccacgt 
ggtgaagatc atgctgccct gggacccaac 
tgaccacgtg agcattgtgg aacccaaaga 
acagaagggt gggaagccag agccgcctgc 
gggtctcctt ggcagacctg cccgggcggc 
cggccgttac tagtggatcc cagctcggta 
gtttcctgt 

<210> 118 
<211> 489 
<212> DNA 

<213> Homo sapiens 



ggttgtggtg tctgggaaac tccgaggaca 60 
cctgatgatc cacagcggag accctgttaa 120 
gttgctcaga cagggtgtgc tgggcatcaa 180 
tggtaagatt ggccctaaga agcccctgcc 240 
tgagatactg cccaccaccc ccatctcaga 300 
catgccccag ccagtcccca cagcataaca 360 
cgctcgaaag cccgaattcc agcacactgg 420 
ccaagcttgg cgtaatcatg gtcatagctg 480 



<400> 118 

tcgagcggcc gcccgggcag 
acctgctgct tcaaaacatg 
attagaaagc aattgactct 
gcctaactac tggaacttta 
ctatgacagg ctgctgaaga 
cgtgatgcca tcgtgtttct 
aggaatatca gtggggtcag 



gtattgaata cagcaaaatt 
atcctttctt actaatatct 
taaataaaca gaaaagtgcc 
gtagttctat aaggtgatta 
acagatatga gcatcaagag 
ggatcataat gttcccatta 
aggttagctt agctgcttgc 



ctatatacaa agtgacctgg 60 
tgatagtcgg tccatagagc 120 
taatgcacat taaatgaatg 180 
acataggtag gatccagttc 240 
gccattttgt gcactgccac 300 
tctgattcta gacacaccac 360 
tgggctagaa cagatatcac 420 
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tccagcatgc tcatctgaca gggtcccgcg gcaacccaga ttaagtcctt gtgaatctgt 480 
gcacaggga 489 

<210> 119 
<211> 181 
<212> DNA 
<213> Homo sapiens 

<400> 119 

taggttccag agacttttgg cccaggagga atatttactt ttagctctgg acatcattac 60 

aaaaaggaat atttcccaaa cctcttcaga ccgagaatac atgggtaaaa ttattaaata 120 

gttgtataat aaaaataatt ttttccttaa aaaaaaaaaa aacctcggcc gcgaccacgc 180 
t 181 



<210> 120 
<211> 489 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 422, 487 
<223> n = A,T,C or G 



<400> 120 

gcgtggtcgc ggccgaggtc catttaaaac 

tctgatgtgc aaaatacaac atcctctagt 

aatagctcat cttaaattaa aaagaaaaag 

attttttttt aaaggtttag agtactttaa 

tcacagagaa tacaaattta gcaatttaat 

tctctcttct taatgcagtg ttctcccaag 

tgggagatat gcaaaatgtg tttttcaatg 

gnctggttca tcctggaact catgggttaa 
cgggccntt 

<210> 121 
<211> 531 
<212> DNA 
<213> Homo sapiens 



aaagaaaaat actaaagcca ctag'taaaca 60 
tggctttatg ccattattac ataagctcca 120 
tggctgtccc atctctgctg cataaatcag 180 
ggaagggaag ttcaaaactg ccagtgaaat 240 
ttcccaaagc tctttgaaga agcaagagag 300 
aggaactgta attttgcttg gtacttatgc 360 
tttgctagaa tataatggtt cctcttcagt 420 
gaaggacttc ttggagccga actgcccggg 480 

489 



<400> 121 
_ cgagcggccg cccgggcagg tggccagcgc 
atttgatgat gcgtcacctg gaaagcaaaa 
agaaaaaatg caaactgacc gggcaaatag 
ttttgcacat ttcattcaac ctgctgctca 
accagggcgc ccacgaataa aaaaagatga 
ccgacaccgt agaacagagc aagaggagga 
aaccaatgtt tgcactcgat ttgaagactc 
agattatcag gtcccgagga ttaaactggc 
gtatccttgc agatgaaatg ggcctaggaa 

<210> 122 
<211> 174 
<212> DNA 
<213> Homo sapiens 



tggtcccgca gacgccgaga tggaggaaat 60 
ggaaatccaa gaaccagatc ctacctatga 120 
attcgagtat ttattaaagc agacagaact 180 
gaagactcca acttcacctt tgaagatgaa 240 
gaagcagaac ttactatccg ttggcgatta 300 
tgaagagcta ttaacagaaa gctccaaagc 360 
tccatcgtat gtaaaatggg gtaaactgag 420 
tcatttcttt gtatgagaat ggcatcaatg 480 
agactcttca acaatttctc t 531 



<400> 122 

tcgagcggcc gcccgggcag gtctgccaac agcagaggcg gggcctccgg catcttcaaa 60 
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gcacctctga gcaggctcca gccctctggc tgcgggaggg gtctggggtc tcctctgagc 120 
tcggcagcaa agcagatgtt atttctctcc cgcgacctcg gccgcgacca cgct 174 

<210> 123 

<211> 531 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 152, 373, 482, 494, 496, 502 

<223> n = A,T,C or G 

<400> 123 

agcgtggtcg cggccgaggt cctcaaccaa gagggttgat ggcctccagt caagaaactg 60 
tggctcatgc cagcagagct ctctcctcgt ccagcaggcg ccatgcaagg gcaggctaaa 120 
agacctccag tgcatcaaca tccatctagc anagagaaaa ggggcactga agcagctatg 180 
tctgccaggg gctaggggct cccttgcaga cagcaatgct acaataaagg acacagaaat 240 
g9gggaggtg ggggaagccc tatttttata acaaagtcaa acagatctgt gccgttcatt 300 
cccccagaca cacaagtaga aaaaaaccaa tgcttgtggt ttctgccaag atggaatatt 360 
cctccttcct aanttccaca catggccgtt tgcaatgctc gacagcattg cactgggctg 420 
cttgtctct-g tggtctgggc accagtagct tgggccccat atacacttct cagttcccac 480 
anggcttatg gccnangggc angctccaat tttcaagcac cacgaaggaa g 531 

<210> 124 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<400> 124 

tcgagcggcc gcccgggcag gtccatctat actttctaga gcagtaaatc tcataaattc 60 

acttaccaag cccaggaata atgactttta aagccttgaa tatcaactaa gacaaattat 120 

gccaattctg atttctcaca tatacttaga ttacacaaag ataaagcttt agatgtgatc 180 

att-gtttaat gtagacttat ctttaaagtt tttaattaaa aactacagaa gggagtaaac 240 

agcaagccaa atgatttaac caaatgattt aagagtaaaa ctcactcaga aagcattata 300 

cgtaactaaa tatacatgag catgattata tacatacatg aaactgcaat tttatggcat 360 

tctaagtaac tcatttaagt acatttttgg catttaaaca aagatcaaat caagct 416 

<210> 125 
<211> 199 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 112, 160, 195 
<223> n = A,T,C or G 

<400> 125 

agcgtggtcg cggccgaggt gctttttttt tttttttttt tttttttttt gctattctaa 60 
aggggaaggc ccctttttat taaacttgta cattttactt tccttctttc anaatgctaa 120 
taaaaaactt ttgtttatac ttaaaaaaac cataaatcan acaaacaaaa gaaacgattc 180 
caacatcact tctgngatg 199 

<210> 126 
<211> 490 
<212> DNA 
<213> Homo sapiens 
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<400> 126 

cgtggtcgcg gccgaggtcc agttgctcta 

actggatctg ggaaaacatt gtcttatttg 

ccattcctag agagaggcga tgggcctatt 

gcccaacagg tgcagcaagt agctgctgaa 

tgtatctacg gtggtgctcc taagggacca 

atctgtattg caacacctgg aagactgatt 

agaagaacaa cctaccttgt ccttgatgaa 

ccccaaataa ggaagattgt ggatcaaata 
gcgacttggc 

<210> 127 
<211> 490 
<212> DNA 

<213> Homo sapiens 



agtggattgg atatggttgg agtggcacag 60 

cttcctgcca ttgtccacat caatcatcag 120 

tgtttggtgc tggcaccaac tcgggaactg 180 

tattgtagag catgtcgctt gaagtctact 240 

caaatacgtg atttggagag aggtgtggaa 300 

gactttttag agtgtggaaa aaccaatctg 360 

gcagatagaa tgcttgatat gggctttgaa 420 

agacctgata ggcaaactct aatgtggagt 480 

490 



<400> 127 

cgtggtcgcg gccgaggtcg gccgaggtct 

caagtgggtc cctgacccct gacccccgag 

ggcacactga cacctcacac ggcagggtat 

tgttagaagg aaaactaaca agcagaaagg 

accatcatca aagaccaaaa gtaaataaaa 

aaactggaaa ctctaaaaag cagagcacct 

cagcaatgga acaaagctgg atggagaatg 

gacgatcaaa ttactctgag ctacgggagg 
actttgaaaa 

<210> 128 
<211> 469 
<212> DNA 
<213> Homo sapiens 



ggagatctga gaacgggcag actgcctcct 60 

cagcctaact gggaggcacc ccccagcagg 120 

tccaacagac ctgaagctga gggtcctgtc 180 

acagccacat caaaaaccca tctgtacatc 240 

ccacaaagat gggaaaaaaa cagaacagaa 300 

ctcctcttcc aaaggaacgc agttcctcac 360 

actttgacga gctgagaaaa gaacgcttca 420 

acattcaaac caaaggcaaa gaagttgaaa 480 

490 



<220> 

<221> misc^feature 

<222> 69, 106, 140, 152, 165, 196, 224, 233, 241, 258, 260, 267, 

291, 347, 395 

<223> n = A,T,C or G 

<400> 128 

cgtggtcgcg gccgaggtgc tttttttttt tttttttttt tttttttttt tgctgattta 60 

ttttttctnt ttattgttac atacaatgta taaacacata aaacanaaaa cagtagggat 120 

cctctaggat ctctagggan acagtaaagt anaaagaggt ctcanaaaca tttttttaaa 180 

gtacaagaca ttcagngctc ggcccaaagg cgtaaaaggt ttanagccag canatagctg 240 

nactaaaggc tccgtctntn tccccanagc caggacaacc ccagggagct ntccattagc 300 

agccagtcca cgcaggcagg atgctgcgga aaaagctcta tgctganaac attccccttg 360 

atggaaagaa gggcaacaca aaaggggtaa ctaanagctc cttcctctcg tgagggcgac 420 

aactgaggaa cagaaaagga gtgtcccatg tcacttttga ccccctccc 469 

<210> 129 
<211> 419 
<212> DNA 

<213> Homo sapiens 



<400> 129 

gcgtggtcgc ggccgaggtc tgattttcat 

ccatgctcaa atccacacca ttggggctta 

tgcagtttaa tccagagatc actgcttctg 



ttaaatattt cagagctata gcatttgcct 60 
agccgctcat gccaacatta gcaaatgaca 120 
ggctgatgca tgccaacaca ctggcgtgat 180 
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ccacgttatg tgcatttttc ttcactttag tgggagaatc aatttttact ccaaggcttc 240 

ttagttgctt aagagttgca ttaaggacac aatctttgtc caccagtctt gaatgatgtg 300 

tttttttctt tgtatggtaa acgttttggg ttctggtgca ttcatgactg ataattactg 360 

ctttggtaga cggctgctca agtttccttg gaggaactat ttaataggtg ggttacttg 419 



agataaccac atcactaaca aagtgggagt 60 

agttggtctc atccctggtc agtttccaca 120 

ggatcatgtc cgggaactgc ggggtagtag 180 

tgagggtgcc acgaagggtc atctgctcag 240 

cagcgacgag gatggcactg gatggcttag 300 

acctgcccgg gcggcccgct cgaa 354 



<210> 130 
<211> 354 
<212> DNA 
<213> Homo sapiens 

<400> 130 

agcgtggtcg cggccgaggt ccatctgagg 
gaccccgcag agcacgctgt ggaattccat 
tgatgatggt cttatctcga gaggcggaga 
cgatctgggt tacccagccg ttgtggccct 
tcatggcggc ggcgagagcg tgtgtcgctg 
agaaactagc accacaacct ctcctgccgc 

<210> 131 
<211> 474 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 421 

<223> n = A,T,C or G 
<400> 131 

cgagcggccg cccgggcagg tctggcagca 
ctgcctgact aaaatttgaa atgacaaccg 
agagatttag gaatactatc tgtcaatcca 
gaaaacaaac ttattttaaa ccaaagaaac 
tttgattact agtataacca cagttgaaaa 
tgggtactaa tggatttata aaaggtttct 
t-tccttgaag acttcagcat cccaaagccc 
nttcccgtaa cttgcccact ccatggtgat 

<210> 132 
<211> 474 
<212> DNA 
<213> Homo sapiens 



gcttcctctg gaataattga cagctttgtg 60 
ctgaatgtaa aatgatgtac ctacaatgag 120 
tagatgtaga aacaaaacaa actacagaat 180 
aaatgtatcc aaaatatagt ccatgatata 240 
cttaaaaaaa aaaattgaca ttttttgtaa 300 
gtttccaaag atgttattgg ggtccacata 360 
gacatcagag atactttcct ttagccattg 420 
gtgacaggct tcccttcatt age a 474 



<220> 

<221> misc__feature 
<222> 403 

<223> n - A,T,C or G 
<400> 132 

ggccgaggtg gggaattcat gtggaggtca gagtggaagc aggtgtgaga gggtccagca 60 
gaaggaaaca tggctgccaa agtgtttgag tccattggca agtttggcct ggccttagct 120 
gttgcaggag gcgtggtgaa ctctgcctta tataatgtgg atgctgggca cagagctgtc 180 
atctttgacc gattccgtgg agtgcaggac attgtggtag gggaagggac tcattttctc 240 
atcccgtggg tacagaaacc aattatcttt gactgccgtt ctcgaccacg taatgtgcca 300 
gtcatcactg gtagcaaaga tttacagaat gtcaacatca cactgcgcat cctcttccgg 360 
cctgtcgcca gccagcttcc tcgcatcttc accagcatcg ganaggacta tgatgaaccg 420 
tgtgctgccg tccatcacaa ctgagatcct caagtcagtg gtggctcgct ttga 474 
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<210> 133 
<211> 387 
<212> DNA 
<213> Homo sapiens 



<400> 133 

tgctcgagcg gccgccagtg tgatggatat 
cgaggtctgc gggcccctta gcctgccctg 
ctttcccaca ctgtgccttt acgatcagcg 
cacggcccgg aaacagcggg aagtaactgg 
tgggcgaatg ccagaccaat gatacccaga 
gtttgactgt gagagagtgt gtgtttgtgt 
gtatagtgtt tcagtggggg agaactg 

<210> 134 
<211> 401 
<212> DNA 
<213> Homo sapiens 



ctgcagaatt cggcttagcg tggtcgcggc 60 
cttccaagcg acggccatcc cagtagggga 120 
tgacagagta gaagctggag tgcctcacca 180 
aaagagcttt aggacagctt agatgccgag 240 
gctacctgcc gccaacttgt tgagatgtgt 300 
gtgtgttttg ccatgaactg tggccccagt 360 

387 



<400> 134 

ggccgcccgg gcaggtctga tgaagaacac 
tgagcggctc aagagtgttg tgggcaactt 
cagccactat gatgggcgcc agttccccaa 
ggatgctccc tgcagtggca ctggggtcat 
ggatgagaag gacatcctgc gcttgtgctc 
attgactctt gtcaatgcga ccttcaagac 
tatcacagtg agacctctgc catggcagaa 



gggtgtgatc cttgccaatg acgccaatgc 60 
gcatcggctg ggagtcacca acaccattat 120 
ggtggtgggg ggctttgacc gagtactgct 180 
ctccaaggat ccagccgtga agactaacaa 240 
acctccagaa ggaagttgct cctgagtgct 300 
aggaggctac ctggtttact gcacctgttc 360 
caggggaagc t 401 



<210> 135 
<211> 451 
<212> DNA 
<213> Homo sapiens 



<400> 135 

ggtcgcggcc gaggtctgtt cctgagaaca 
taatgtgagt gaggaagtga ctgtatgtgg 
tgaggacctg g^ctgggtta ggaacagttg 
aagtgaatgt ggtctggagt gtgtccttgg 
gggccgtcag ggagctcatc ccttgtgttc 
gaggagggta acctgctggc tggagcggca 
attttaaaaa ccaaaaagca taaacattct 
acttaacgga aggacttctc cattcaccat 

<210> 136 
<211> 411 
<212> DNA 
<213> Homo sapiens 



gcctgcattg gaatctacag agaggacaac 60 
actgtggaga aagtaagtca cgtgggccct 120 
tactttcaga ggtgaggtgt cgagaaggga 180 
ccttggctcc acagggtgtg ctttcctctg 240 
tgccagggtg gggtaccggg gtttgacact 300 
gaacagtggc cttgatttgt cttttggaag 360 
ggtccttcac aatgctttct ctgaagaaat 420 
t 451 



<400> 136 

ggccgcccgg gcaggtctga atcacgtaga 
tcagtatgag ggttttcgac ctgggatgta 
tgaatttgtg cagaactttg acccccttta 
tgagggaaat gttggacatg tgcaggtggg 
tctgtggatt tcccctccat caatcatctt 
gaaacatctc tggtataaga aaatcctcaa 
gtggaggaag tttcagacca tcctgctcta 



atttgaagat caagatgatg aagccagagt 60 
tgtccgcgtt gagattgaaa atgttccctg 120 
ccccattatc ctgggtggct tgggcaacag 180 
tccctttgct gcgtatttgg tgcctgaggc 240 
accctctcat ccccctcaga tgcgtctgaa 300 
gtcccaagat ccaatcatat tttctgtagg 360 
ttatatccga agaccacaat g 411 



<210> 137 
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<211> 211 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_f eature 
<222> 186 

<223> n = A,T,C or G 



<400> 137 

cggccgcccg ggcaggtcgg 
ggggtgacag aggccgtggt 
ggttcaggcc ctttgcacca 
cccggntcaa gccagcacct 



ttggtgcggc ctccattgtt 
cgtggtgggc gctttggttc 
catatcccat ttgacttcta 
cgatgaaact t 



cgtgttttaa ggcgccatga 60 
cagaggaggc ccaggaggag 120 
tttgtgtgaa atggcctttc 180 

211 



<210> 138 
<211> 471 
<212> DNA 

<213> Homo sapiens 
<400> 138 

gccgcccggg caggtctggg ctggcgactg 
aggcggggtc tcccttctgt gtgttcaagt 
aaaatgcact gagtttgggt taaaaaccaa 
agccaagggc gtggccggct ctcccaacac 
cctacatccc ctctcagcac tgaacagtga 
tgagtaatat tgcataggag taccaagaaa 
ttaaataaaa agcctggccg caggctgcgt 
acggtgacca aaccacggag gcaagcttct 

<210> 139 
<211> 481 
<212> DNA 

<213> Homo sapiens 



gcatccaggc cgtaactgca aatctatgct 60 
gttctcgact tggattctta actattttaa 120 
ccaccaaaat ggatttcaac acagctctaa 180 
agcgactcct ggaggccagg tgcccatggg 240 
gttgattttt ctttttacaa taaaaaaagc 300 
ctgcctcatt ggaaacaaaa actatttaca 360 
ctgccacatt tacagcacgg tgcgatgcac 420 
ggcactcaca ccacgacccg c 471 



<220> 

<221> misc_feature 
<222> 384 

<223> n = A,T,C or G 



<400> 139 

gtcgcggccg aggtctgttc tttagctcag 
agaatgaatt cccagttcct gagcagttca 
ccacagtgac agaaattgct ggataagcga 
caccatggga taaatctgta tcaagacggt 
cttaaaactg cttctctgct ctgagaagca 
aggctgaaat ttggggtggg atagcaggtc 
tccatatcag ctcaaccacg ccgncagtcc 
gatgcatttt agcttttgag cttttggggg 



atttaaacct gctgtctctt ctttatttgc 60 
agaccctatg gaacgggcag aagttggtca 120 
agtgccactg ggttctttgc cctcccttca 180 
tcttttctag atttcctcta cctttttgct 240 
cagctacctg ccttcactga aatatacctc 300 
agttgatctt ctgcaggaag gtgcagcttt 360 
attcttaagg aactgccgac taggactgat 420 
gtattctacc aaccaacagt ccatttggaa 480 

481 



<210> 140 
<211> 421 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_featur€ 
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<222> 372 

<223> n = ArT,C or G 



<400> 140 

gtcgcggccg aggtttccca tttaagaaaa atagatcttg agattctgat tcttttccaa 60 

acagtcccct gctttcatgt acagcttttt ctttacctta cccaaaattc tggccttgaa 120 

gcagttttcc tctatggctt tgcctttctg attttctcag aggctcgagt ctttaatata 180 

accccaaatg aaagaaccaa ggggaggggt gggatggcac ttttttttgt tggtcttgtt 240 

ttgttttgtt ttttggttgg ttgggttccg ttatttttta agattagcca ttctctgctg 300 

ctatttccct acataatgtc aatttttaac cataattttg acatgattga gatgtacttg 360 

aggctttttt gntttaattg agaaaagact ttgcaatttt ttttttagga tgagcctctc 420 
c 421 

<210> 141 

<211> 242 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 4, 6, 20, 31, 35, 39, 72, 94, 141, 142, 211, 222 
<223> n - A,T,C or G 



<400> 141 

cgantngccc gcccgggcan gtctgtctaa ntttntcang gaccacgaac agaaactcgt 60 
gcttcaccga anaacaatat cttaaacatc gaanaattta aatattatga aaaaaaacat 120 
tgcaaaatat aaaataaata nnaaaaggaa aggaaacttt gaaccttatg taccgagcaa 180 
atccaggtct agcaaacagt gctagtccta nattacttga tntacaacaa cacatgaata 240 

242 

<210> 142 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_feature 

<222> 15, 19, 32, 73, 110, 278, 405, 436, 473, 510 
<223> n = A,T,C or G 



<400> 142 

agcgtggtcg cggcncgang tccacagggc 
gtttgaggtt tangtttgcc attgtctttc 
accaagtgca aacctgtgct ttctatttca 
tgtgcagggg attgtagcta atgcattaca 
aagtgatcat accaacgtgt tatacactca 
caagtacagt cctgacaaaa gggcaagttt 
caaggggccc gcaactaggc tattattcat 
ctggtaaatc atgtgntgct aaatcatttt 
tttagtggca gccacctgcc cggcggccgn 
actggcggcc g 

<210> 143 
<211> 515 
<212> DNA 
<213> Homo sapiens 



anatattctt ttagtgtctg gaattaaaat 60 
caaaaggcca aataattcan atgtaaccac 120 
cgtactgttg tccatacagt tctaaataca 180 
cagtcgttca gtcttctctg cagacacact 240 
actagaanat aataagcttt aatctgaggg 300 
gcataataga tcttcgatca attctctctc 360 
aaaacacaac tgaanagggg attggtttta 420 
ctgaacagtg gggtctaaat cantcattga 480 
tcgaagccca attctgcaga tatccatcac 540 

551 



<220> 
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<221> misc_feature 
<222> 5, 286, 498 
<223> n = A,T,C or G 



<400> 143 

cgagnggccc gcccgggcag gtatcttcac 
acattcccct agtccaatag ctgacaaatt 
ttcatcaagt ggccgtaatc catttgcaca 
cattcccaca cggccaagca catctttgct 
aggtatttca tagcgaaaga aggggtcaaa 
taagttcact ttgggtgaat gtctgatgaa 
gaagtgtgtc tgtccaggat tctcactgac 
atgtttttca gaagacaatg caagtaacaa 
ctagttctct taagccgnga cactgatcag 



aaactcaaca aaggcactac atgagacttc 60 
tttgcaacgt tctgcaatgc gaattaactc 120 
cactactagt tcaaccagtc tagggcatgt 180 
tactgatctc ccaaagtaca gatgggtggc 240 
ttcttcttca tataanaaaa aatacatcac 300 
agcatcccag ctactcttct gaatagtatg 360 
tacatcaatg cgcaaatgtt ctaatcgaac 4 20 
ctcatcactc aataagtggt aagttcaggg 480 
cacac 515 



<210> 144 
<211> 247 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc feature 

<222> 11, 20, 42, 115, 152, 165, 181, 195, 208, 221 
<223> n = A,T,C or G 



<400> 144 

tgcattctct ntggatgcan acctgcccgt 
acggttacac ctgtgccgtg ggtgacgtcc 
ttgtggaagg gcagactatc cacctccatg 
ntgtgctgca ccaanatgcc cagcattnta 
ggcacac 

<210> 145 
<211> 309 
<212> DNA 
<213> Homo sapiens 



tggtagggac tntgctcaca cggaacatgg 60 
accagcttct ggatcatctc ggcgngggtg 120 
cncacgatgc ccganacgcc actccggact 180 
tcttcaagca nagcacttat cagggtcctt 240 

247 



<220> 

<221> misc_feature 
<222> 18, 155, 247 
<223> n = A,T,C or G 



<400> 145 

cgtgggtcgc ggcccgangt ctgctgtaac 
aatggaattt tatttctcac gcttctggag 
ctcagtgtct ggcaaggtct cggtttctgc 
cacaagtagg aaggtgcaag aagctcccct 
tcatganggg gaaacgtaat gacctaatca 
accttgggg 

<2X0> 146 

<211> 486 

<212> DNA 

<213> Homo sapiens 



aaaacaccat agtctgggca gctcatagac 60 
gctggattcc aagatcaagg ttccaggaga 120 
ctcanagatg gtgccatctg gctgtgtcct 180 
caggctctgt ctgtaagaca ctgatcccat 240 
gcccccagag accccacttc taacaccatc 300 



<220> 

<221> misc_feature 

<222> 16, 97, 154, 244, 275, 



322, 347, 349, 352, 357, 449, 460, 
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472 

<223> n = A,T,C or G 



<400> 146 

agcgtgggtc gcggcncgac gtcctgtcca 
gctgacatca tattttgtcc ctacaactat 
gatttaaatc tgaaagaaca ggttgtcatt 
gctcgggaat cagcaagtta cagtgtaaca 
ctanatagta tggtcaacaa taatataagg 
tgctgtagcc tcattaattg gntagaagca 
gaatcagctt gtaaaatatg gagtggaaat 
atcaccactg ctacttttcc cattttgcng 
tttaag 



tatttcacag cccgagaact aatacaagat 60 
cttctanatg cacaaataag ggaaagtatg 120 
ttanatgaag ctcataacat cgaggactgt 180 
gaagttcagc ttcggtttgc tcgggatgaa 240 
aaganagatc atgaacccct acgagctgtg 300 
aacgctgaat atcttgnana angagantat 360 
gaaatgctct taactttaca caaaatgggt 420 
gtaagatatn ttttctacct gngaaacgta 480 

486 



<210> 147 
<211> 430 
<212> DWA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 13, 26, 28, 289, 299, 352, 390, 399 

<223> n = A,T,C or G 



<400> 147 

gccgcccggg cangttcgac attacntnga 

acaatgaatg caagaatttg aggatctcct 

tcccttcttc t-tcctcttca tcttcatctt 

ttctttgtct ttatcgtgag atgaaggtga 

aatttcgctg caagtctctt gactggctgt 

agtcttttta cctcatgccc ctcagcttca 

aagggctcac tgaggaaact tctgattcan 
cattttgctg 

<210> 148 
<211> 483 
<212> DNA 

<213> Homo sapiens 



gttccatgat gtacaattct ttcacgaaaa 60 

tactcctccc ttttacagat ggtctctcaa 120 

cttctgaacg cgctgccggg taccacggct 180 

tgcttctgtt tcttctacca taactgaaga 240 

ttctccgact tcgcctttnt gtcaaacgng 300 

cagcatcttc atctggatgt tnatttctca 360 

atgtcgaana gcactgtgaa gttttctctt 420 

430 



<220> 

<221> misc_feature 

<222> 20, 24, 53, 55, 374, 381, 423, 431, 459 

<223> n = A,T,C or G 



<400> 148 

cccgggcagg tctgtgttgn tttncaaccg 
tgtggagcgg gtgatgatga cccctcgctg 
gggtctggtc tgggaccacc cgtacaggtt 
gtccggcagg atctgctgac tctccatgca 
agattccaaa tatggcatat agggtggggt 
cagatccatc cacaaaattt gatggctcat 
agctcttctc tggntctcga nggtttgcat 
canacctcgg ncgcgaccac gctaagccga 
get 



gtgtcctccc cagcgtccag aananggaaa 60 
tcctgtcacc tcctgcacag cttcgtatgt 120 
gtgcacgttg tagtgctcca cgggggagct 180 
cagagtcttg ctgctcaggc ccttgtccct 240 
tatttagcat ttcattgctg cagcccctga 300 
tcatatcaat ccacaatcca tcaaacttca 360 
agaactcttc tatctctttc ttccaccacg 420 
attctgcana tatccatcac actggcggcc 480 

483 



<210> 149 
<211> 439 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> inisc_feature 
<222> 11, 359, 384, 402 
<223> n - A,T,C or G 



<400> 149 

ctttcacgaa nacaatgaat gcaagaattt 
tggtctctca atcccttctt cttcctcttc 
gtaccacggc tttctttgtc tttatcgtga 
ataactgaag aaatttcgct gcaagtctct 
tgcaaacgtg agtcttttta cctcatgccc 
ttcatttctc aaagggctca ctgaggaaac 
agtttctctt catttgctgc aaanttgctc 
tggctgcatg ggggctgac 

<210> 150 
<211> 578 
<212> DNA 
<213> Homo sapiens 



gaggatctcc ttactcctcc cttttacaga 60 
atcttcatct tcttctgaac gcgctgccgg 120 
gatgaaggtg atgcttctgt ttcttctacc 180 
tgactggctg tttctccgac ttcgcctttt 240 
ctcagcttcc acagcatctt catctggatg 300 
ttctgactca catgtcgaag aagcactgng 360 
tttgctggct gngctctcag accacccatt 420 

439 



<220> 

<221> iaisc__feature 

<222> 4, 15, 260, 336, 371, 430, 

<223> n = A,T,C or G 

<400> 150 

ggcncgcccg ggcangtcca ctccactttt 
agggtcaggg agtcctggca gctccgcagc 
tccagtgcaa tgccattgat cgcaacgatc 
gcggcagggc ttccaggcag cacagcggca 
ctgtctttct gtccactgan gttgatgtgc 
ttcctccgcc gcacgaccat gttgatgggc 
ctgcttcttg nccttggtga tgaagtccac 
cttaagcggn catcagcaat gcttcctttg 
ccgggaaaca agggtcattc acaccttctg 
agccgaattc tgcagatatc catcacactg 

<210> 151 
<211> 503 
<212> DWA 
<213> Homo sapiens 



461, 535, 572 



gagctctgag ggaatacctt caggagggac 60 
agagattcac attcattcag agacttgttg 120 
ctgtctccca cagcaaggga cccttc-ttta 180 
gcatacactc cattctccag actgatgcca 240 
agcggcgtga ccaccttccc acccagggac 300 
cccctnccca ttgaggagcg ccttgatggc 360 
atcggtgatt ctcacagcca gtcattgacc 420 
gccactttag ngacaaatat gccacagtcc 480 
gcatatcaaa cacctcggcc gggancacta 540 
gngggccg 578 



<220> 

<221> misc_feature 
<222> 392, 464 
<223> n = A,T,C or G 



<400> 151 

cgagcggccc gcccgggcag gtctgggaga 
gctcgtcctt tcactacagc ggaatgcaat 
atttcattcc ttttcttttt acaacttcac 
gtgctgtgga acccagagtg ctcttgcctg 
cacctactcc atgatggccc ggtatagtgc 
gttgataact cgttgccgtt tcttttcttg 
gaagcgcatg tctgaagcta ctgacagtct 
cagtgtgtat acatcatctc tcttaaccac 



tcagcgactg ctgccacgtg cccagaaatg 60 
gagggtgggt gagaagatga tgggtcggtt 120 
tttcagagac ttcagcgttc catgtctgct 180 
gatggctgag aatcccttgg accctggaag 240 
aggctcaata taatcttccc ggtatcttga 300 
cttaacctct ttctctgtga aaatctcatt 360 
anatttgact ctcttgggaa gctcttcatc 420 
aagttggagc catncttaaa cttcacctgg 480 
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tacatttgga tagggtggga ggc 503 

<210> 152 
<211> 553 
<212> DNA 

<213> Homo sapiens 
<220>. 

<221> inisc__feature 

<222> 293, 432, 459, 481, 536 

<223> n = A,T,C or G 



<400> 152 

agcgtggtcg 

gagtcctgac 

ggtggctcct 

cttcccagcc 

aggagctccc 

tgtgttttaa 

cgggcggccg 

atgcatctag 

ntttacaacg 

ccccctttcg 



cggcccgagg 
ttccaggaag 
gaggccagag 
ccgggccgaa 
ccgttagttt 
gcgtctgtct 
ctcgaaagcc 
anggcccaat 
tcgtgactgg 
cca 



tccactgagc 
gacaggacac 
gacgccttcc 
cgttcgggtt 
ccacgttgtg 
ctgtagtaaa 
gaattctgca 
tcgccctata 
gaaaaccctg 



tccgccttcc 
agaggcaaga 
gcgatccatg 
aataagcaga 
agcacattca 
ctgaaatgtt 
gatatccatc 
gtgagtcgna 
cggtacccac 



ccgggctccc 
actcagcctg 
gctcagcatc 
gcagttattc 
tacttaagac 
aacagaaatg 
acactggcgg 
ttacaattca 
ttaatcgcct 



tgaggaagca 60 
tgaggctctg 120 
gtccttctgg 180 
ggctcctggc 240 
tgnttctctt 300 
cagacctgcc 360 
ccgctcgagc 420 
ctgggccgcg 480 
tgcagnacat 540 
553 



<210> 153 
<211> 454 
<2i2> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 198, 307, 325, 347, 386, 389, 392, 415, 425 
<223> n = A,T,C or G 



<400> 153 

tcgagcggct cgcccgggca ggtccaccta 
aggggacccc cttcacctct ggcaagagag 
gctagcacag agcaggctca cttgtcttgg 
aaaccaaatg aaggttgntg aatgacccct 
ctctgcagtg gaatgcctgt gtgtttgagt 
aaccgantca agtgatctgt gcatncagac 
accttaacga tctggaatga aactgnganc 
gccgncttct tgttggacct tggccgcacc 

<210> 154 
<211> 596 
<212> DNA 

<213> Homo sapiens 



gcatggctcc tctaaacacg caactcagcg 60 
ctgggtagat cagaaacttg gtgacacctg 120 
tcccactacc cagattcctg cagacattgc 180 
gtccccagcc acttgttttg gtatcatctg 240 
tcactctgca tctgtatatt tgagtataga 300 
acactggggc acctgancac agaacaaatc 360 
antgcccgcc tgggtgggtc tgganaaact 420 
acct 454 



<220> 

<221> misc_f eature 

<222> 19, 33, 37, 131, 377, 425, 439, 505 
<223> n = A,T,C or G 



<400> 154 

agcgtggtcg cggcccgang gcggcctcct 
cggcaggatt aacctccatt tcagctaatc 
aactggttta naggtacagt tccccttaaa 



gantganggg aagggacgtg ggggcggcca 60 
atgggagaga ttaaagtctc tcctgattat 120 
aagattattg tggatgatga tgacagtaag 180 
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atatggtcgc tctatgacgc gggcccccga agtatcaggt gtcctctcat attcctgccc 240 
cctgtcagtg gaactgcaga tgtctttttc cggcagattt tggctctgac tggatggggt 300 
taccgggtta tcgctttgca gtatccagtt tattgggacc atctcgagtt cttgtgatgg 360 
attcacaaaa cttttanacc atttacaatt ggataaagtt catctttttg gcgcttcttt 420 
gggangcttt ttggcccana aatttgctga atacactcac aaatctccta gaagccattc 480 
cctaatcctc tgcaattcct tcagngacac ctctatcttc aaccaacttg gactggaaac 540 
agctttggct gatgcctgca tttatgctca aaaaatagtt cttggaaatt ttcatc 596 

<210> 155 

<211> 343 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 6, 12, 23, 44, 58, 86, 99, 279, 310, 319 
<223> n = A,T,C or G 

<400> 155 

ctcganttgg cncgcccggg 
ctggctctgt ttccggagct 
tagccttggg tacacatact 
aatgggatct attgtttctc 
aactacttca agtacggaag 
gctaatgggn tctctgctnt 

<210> 156 
<211> 556 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 34, 375, 530 
<223> n = A,T,C or G 

<400> 156 

tcgagcggcc cgcccgggca ggtctggcac . cacncagatc gattaactgg ctcatctgat 60 
ctcgtggccc ccaccctgga actgacttag cacaaaagga cacctcaatt ccttatgatt 120 
tcatctccga cccaaccaat caacaccctt gactcactgg ccttccccct cccaccaaat 180 
tatccttaaa aactctgatc cccgaatgct cagggagatc gatttgagta ctaataagac 240 
tccagtctcc tgcacaagca gctctgtgta ctcttcctct attgcaattc ctgtcttgat 300 
aaatcggctc tgtgtaggcg gcggaagaag tgaacctgtt gggcggttac cacctctgtc 360 
gtgtgtgaca gttgntttga atctctaatt gctcagtaca gatccacatg caggttaagt 420 
aagaagcttt tgaagaaaat ggaaagtctt aagtgatggc ttccaagaaa tcaaacctac 480 
attaattagg gaacaacgga ctttacgtat cacaaatgaa gagactgacn aagtaaatca 540 
acttggcctt ttctta 

<210> 157 
<211> 333 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 18, 40, 55, 57, 60, 91, 97, 103, 110, 161, 173, 193, 195, 
196, 214, 231, 233, 238, 263, 264, 266, 283, 284, 287, 297, 
298, 323, 331 



cangtctgcc tggtttttga ccgngcgagc tatttagnct 60 
caaggnaaaa atcttgaana actcgagcag cttctgtgga 120 
gccgagcata gccaatgtac tttctcaata gctggtgggg 180 
caggaaccac ctttagtctt tctgataatg gcttctcaga 240 
tatttgaatc ttgactatnc atacgagcta ctgtggcact 300 
ccagctctta ttgcaatcac atg 343 



wo 02/083956 



PCT/US02/12378 



45 



<223> n = A,T,C or G 



<400> 157 

ggtccacaaa aatatatnaa ataagctgga 
cattccttca gttattcaaa ctcactgata 
acttcctaag ctaaaagtat atttacatat 
tacttcaaat aangnngaaa ttccagaatt 
tcaccaattg gtttgcaatc aanngnccag 
accaaaaggg gagaaaacct ggnagggaaa 

<210> 158 
<211> 629 
<212> DNA 
<213> Homo sapiens 



tatataaaan caaacactta acatngncan 60 
nctaacnggg agnagttggn attctggaag 120 
ttacaacaca ngtaaatata acngaagaac 180 
ctanagattt atagctatag ntnacaanta 240 
cactacttat gannaangtt taactannaa 300 
nat 333 



<220> 

<221> inisc_feature 
<222> 345, 565 
<223> n = A,T,C or G 



<400> 158 

tcgagcggcc gcccgggcag gtctggtaca 
ccagtaagtg gtcgagccct ttctgcagaa 
tctccacaca agcaatcagt ggtttctgtg 
ctccctcttc ttctaagcgt ttacttacat 
gcattaacct ttggccttcg gcagcatata 
caaatgaatc tttatacacc tgcaggtcag 
ctccggtcca cggcctcgcc gctcctctcg 
gttttactct gaaccatttt atcactaata 
tcccagatgg agggcagcgt ggagttctgc 
atgcttctga tcatgaatca tttgnctggc 
ttttaaaaat aaacccttat ctaaacgtc 

<210> 159 
<211> 629 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 33, 546, 576 
<223> n = A,T,C or G 



tttgtgcgag gtccggcact ctgttctcat 60 
ttgctgttaa atgttctcct aatagctgtt 120 
■tgctgtggtc caagtaagtg attactctgt 180 
ggttaagata ttctggaacc tctctttcct 240 
agcaattagt ctcttccaaa aatttcagtt 300 
acagcatgcc caggnaggct ccgcaacagg 360 
cgctcgatca gcagtaggat tccatcaatg 420 
atatgggttc taaacagttc taatcccata 480 
agcacatagg tgcggtccaa gaacaggaag 540 
aatggtcctg ccagcacgtg gtaatctttc 600 

629 



<400> 159 

tcgagcggcc gcccgggcag gttctagagg 
aatcgaatat tcaacaccat gaagataaat 
cccaagcttg ccctgaatac tttgattgga 
ttgttgtagt taggatacta aaaggatatt 
gatgaatagt gttcagtaaa atgaagcagt 
ttttcgtcta ttcttaatat tttttaatta 
gatacaatga tccgctttag tgagaggaca 
atctcacctc ttcattctct tgttacattt 
aaaagataga catggtgcca tgaagtttgg 
acagangagc tttccttaaa tgccctttac 
agtaaaagtt attttcatat atgttgggg 



ganaatctgg ctgatttggg aataaaatat 60 
cttattttgg aaatctactg accttaatac 120 
attggaatat atcaaaaaag gttagtattt 180 
agttacccaa gagatccaat ttgtttttct 240 
cttaagagtg actaataatt tcaaagtgat 300 
tttattttta agagttttat accttgagca 360 
atttctgatt gattgttttc tcttcaggcc 420 
gaagcagttg atataatggg tttatacttt 480 
ggaagttggg tgaattatcc cattctagtt 540 
ttctangttt ggtcaagaag tcattttctg 600 

629 



<210> 160 
<211> 519 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc__feature 

<222> 46, 309, 397, 430, 434, 471, 497 
<223> n - A,T,C or G 



<400> 160 

tcgagcggcg cgcccgggca ggtctgctgg gattaatgcc aagttnttca gccataaggt 60 

agcgaaatct agcagaatcc agattacatc cacttccaat cacgcggtgt ttgggtaatc 120 

cacttagttt ccagataaca tacgtaagaa tgtccactgg gttggaaacc acaattatga 180 

tgcaatcagg actgtacttg acgatctgag gaataatgaa tttgaagaca ttaacatttc 240 

tctgcaccag attgagccga ctctcccctt cttgctgacg gactcctgca gttaccacta 300 

caatcttana attgggcggg tcacagaata atctttatct gccacaattt taggtgctga 360 

agaaataagc tcccatgctg cagatccatc atttctnctt taagcttatc ttccaaaaca 420 

tccacaagan caangttcat cagccagaga ctttcccaga atgctgatag nacacgccat 480 

accaacttgt ccaacancca ctacagcgat cttattggt 519 

<210> 161 
<211> 446 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 5, 32, 36, 269, 354, 381 

<223> n = A,T,C or G 



<400> 161 

cgagnggccc gcccgggcag gtccagtaag cntttnacga tgatgggaaa ggttatgcaa 60 

ggtcccagcg gtacaacgag ctgtttctac atcatttgta ttctgcatgg tacgtacaat 120 

agcagacacc atctgaggag aacgcatgat agcgtgtctg gaagcttcct ttttagaaag 180 

ctgatggacc ataactgcag ccttattaac caccacctgg tcctcgtcat ttagcagttt 240 

tgtcagttca gggattgcac gtgtggcang ttctgcatca tcttgatagt taatcaagtt 300 

tacaactggc atgtttcagc atctgcgatg ggctcagcaa acgctggaca ttantgggat 360 

gagcagcatc aaactgtgta natgggatct gcatgccctc atctaatgtc tcagggaaca 420 

tagcagctcg taccctctga gctcga 446 

<210> 162 
<211> 354 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 6, 19, 36, 116, 152, 174, 186, 196, 223, 249 
<223> n = A,T,C or G 



<400> 162 

agcgtngtcg cggcccgang tcctgggaag 
aggcggctgc ggatccagcg gtccaccagg 
gagggcacaa aacccttccc aaggccacga 
ttgcanaagt ggcggnaacc cagtatccgg 
gacatgtang cacataatcc aaaccggaga 
gggaagtcag ctttctgccc ttctttggcc 



cctttnttgc tgagcctcac agcctctgtc 60 

ctctcatggc ctccgggctg ggaggngggt 120 

anggcaaact tggtggcatt ccanagcttg 180 

ttcacatcca ggntgatgtc acgaccctgg 240 

gcatcggtgc cacattcacg aatccccgct 300 

ttctccacct cgctgggatc cagg 354 



<210> 163 
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<211> 258 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> inisc_feature 

<222> 7, 24, 32, 153, 198, 205 

<223> n = A,T,C or G 



<400> 163 

tttttcncca agtcctcttg ccgngggatc 

ttatacccag gccctgcaaa attgctgggt 

ttattattct gttggatgat tctattttaa 

ctccgctcgt caagagangc caatntgtct 
ctttcttctt cttgagga 

<210> 164 
<211> 282 
<212> DNA 
<213> Homo sapiens 



tngactgcaa tttaagacac ttctaattag 60 
ttatataata tattcttgct gcacgaagat 120 
ttntatttat tctggccaaa aaagaacctt 180 
tgaaggacaa gagaaagatg ctaacacaca 240 

258 



<220> 

<221> raisc__feature 

<222> 97, 130, 163, 178, 203, 204 

<223> n = A,T,C or G 



<400> 164 

ggaacatatt acttttaaat tacttgggtc 

tctatataat acgtatgtat aaaataagcc 

tataaatcan atgatctgac ttctaagagg 

gaatactggt agtactagag ganngacgct 

cacagggtaa atgacaaagc caatgactga 



aatgaaacat ttaataaaaa catttgcttc 60 

ttttcanaaa ctctggttct cataatcctc 120 

aacaaattac agnaaggggt atacattnat 180 

aaaccactct actaccactt gcggaactct 240 
ctctaaaaac aa 282 



<210> 165 
<211> 462 
<212> DNA 
<213> Homo sapiens 



<220> 



<221> misc feature 

<222> 10, 33, 36, 49, 198, 222, 243, 278, 357, 385, 399, 405, 437 
<223> n - A,T,C or G 

<400> 165 ^ ^ 

gcccgggcan gtcctgtaat cccagctact cangangctg agtcatgana atcgcctgaa 60 
tccgggaggt agaggccgca gcgagcaaag attaagccac tgcactccag tctgggtgac 120 
agagtgagaa tctgtctgtt gctcctctgg cattggtctg aaatgggttt gtagaacatg 180 
ccacagaagg accagcanca goaacaaatg gatttgtgga angcgtagct ccaaatggag 240 
cangcacact tgatgaagca cgctgtgtct gtgcagangc aaccactggc actgttccaa 300 
aaacattgct gctagcatta cttgtggaag tatacgcatt actggaggtg gctgcanaac 360 
tgaaaacgct gtctagttct gccanagctg catacttgnc tgaanatgca cttgactgac 420 
tgggaactga accacanaac caacaggacc tttacctgtg ga 



<210> 166 
<211> 365 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> 14, 18 

<223> n « A,T,C or G 



<400> 166 

cgtgggtcgc ggcncgangt ctgaaaccaa 
accaggatag atggaatcaa aagactctga 
acttagcaag ctgaggactt cagtgtccat 
atatgataga gatattccat ctgagctgga 
cagaaaacca aatacatgtt ctcatttaga 
cacaaagaaa ggaacaacag acactggggc 
gaaga 

<210> 167 
<211> 364 
<212> DNA 
<213> Homo sapiens 



tccagaacta aacatcagca cacaaaaaat 60 
agccaaaagg aggctaggga gagcaactga 120 
catccgatcc tgccctgtaa caacaggtct 180 
ggccattatc cttagcaaac taacacagaa 240 
agtaggagct aaatgatgag aactcaagga 300 
ctacttgagg gtggagggtg ggaggaggga 360 



<220> 

<221> niisc_feature 
<222> 19, 342, 361 
<223> n = A,T,C or G 



<400> 167 

agcgtggtcg cggcgcgang tccagcccta 
ctctctctaa aagttgctga ctctttactg 
taaggggagg ggtgtctcac tcaacatggt 
ctgggtgccg tggctcatgc ctgtaatccc 
cacctgaggt ctggagttca agatcagcct 
aaaatataaa acaattagcc aggcatggtg 
ngct 

<210> 168 
<211> 447 
<212> DNA 
<213> Homo sapiens 



gcttgcctgt gactccgcct tcactgggtg 60 
tatctcccaa ttcccactcc attggttcca 120 
gttcctggta ccaagaactg gctgacgaag 180 
agcact-tttg ggaggccaag aagggcggat 240 
gaccaacatg atgaaaccaa gtctccacta 300 
gtgggtgcct gnaatcccag ctactgggga 360 



<220> 

<221> misc_feature 
<222> 407, 414, 437 
<223> n = A,T,C or G 



<400> 168 

cccgggcagg tcaaaaccca 
tacaaggata acagaaccag 
aaagctgcag ttcatccagc 
cttgctcact gtctgctcca 
ctttgaactc ttcataactt 
cgtccacaga cctcggccgc 
ggcggccgtt cgagcatggc 
aattcactgg ccgtcgnttt 

<210> 169 
<211> 524 
<212> DNA 
<213> Homo sapiens 



aaacctttca ttttagccca 
ttgtcaggac gagcatttga 
tcatggcatg tgtctttata 
tagaaaatca cggtattgtg 
cggtatttcc cttcattcac 
gaccacgcta agcccgaatt 
atctagaagg cccaattcgc 
acaacgc 



aaccagctca tgattaggta 60 
caagtaaaag caattcttgc 120 
tagcatcctc gcaatgtcag 180 
gagaagcaat tgggcatcag 240 
tttctcttga atggtgggaa 300 
ctgcagatat ccatcacact 360 
ctatagngag tcgnattacc 420 

447 
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<220> 

<221> misc_f eature 

<222> 4, 6, 39, 40, 235, 248, 313, 340, 359, 382, 389, 420, 434, 

442, 453, 496 

<223> n = A,T,C or G 



<400> 169 

cgantngcgc gcccgggcag gtctgagcag cctttctgnn tgctggacta ttgggattgg 60 
gttcatccaa cagagactgt atggatgtta gaatggaaga cacatcatag gttggactcc 120 
aacggttctg aagtatgtcc agacatatac taccatctgc atagactaag aacaaagaag 180 
taggtacatt aaacgtaaca agaccactaa ggttttaaca ttatagacaa aacanaaata 240 
gtcaaganta ctttgctttt gaagtttaaa gattcctatg ttgcttccca gttaactgcc 300 
taaaaagata agncataacc accactagtg aaataatcan gatgatcaga gaatgtcana 360 
tgtgatcagt ataaaactgg angatattna gtgtcatcct ttggaaaagg ctgccctatn 420 
atccaggaaa tcanaaacat tnttgaacag ggnccctagc tatccacaga catgtgggaa 480 
attcattccc caaatngtag gctggatccc ctatctgaaa taac 524 

<210> 170 
<211> 332 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<222> 5, 10, 63, 66, 90, 93, 96, 186, 207, 261, 290, 324, 326 
<223> n = A,T,C or G 



<400> 170 

tcgancggcn cgcccgggca ggtgacaaac 
aanaanatca gaagggatgg tgacaagaan 
gatcaagaag agctcaacaa aacaaagccc 
aatgangagt acggagaatt ctataanagc 
gtgaagcatt tttcagttga nggacagttg 
cgtgctcctt ttgatctgtt tganancaga 

<210> 171 
<211> 334 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<222> 5, 9, 200, 228, 232 

<223> n - A,T,C or G 



ctgttattga agatgttggt tctgatgagg 60 
aanaanaaga agattaagga aaagtacatc 120 
atctggacca gaaatcccga cgatattact 180 
ttgaccaatg actgggaaga tcacttggca 240 
gaattcagag cccttctatn tgtcccacga 300 
aa 332 



<400> 171 

cgagnggcnc gcccgggcag gtctgttgat 
taagcataaa aaattacagt ctttctaccc 
cccaagacca gaaataataa gtcctgtttc 
ttggcctgac accacattan aatttggtct 
aagccatttt atactatcga cctaaattcc 
aagccctctt atatgctagc tgtaggaaat 

<210> 172 
<211> 439 
<212> DNA 
<213> Homo sapiens 



agcgacttaa cagaaaagtc tagacaaaca 60 

ttgggaatgg ggagaaaaag gaatctctac 120 

tggtcctgaa catccagaat tatggaggct 180 

ggaaatcaaa ctttaganac angagatcgt 240 

agtctaacgg ttcctttaca aagttgcgga 300 
atag 334 
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<220> 

<221> misc feature 

<222> 19, 375, 388, 390, 395, 409, 426, 434 
<223> n = A,T,C or G 



<400> 172 

agcgtggtcg cggcccgang tctgcctata 
tcattctcac aggtcatcat cctcatccgg 
gtgctcatac tgtgctgcca aagctgggtc 
catggcaaca aattccaagt tagggtctcc 
cttttcaaag ttgtagttac ttttggcaga 
gtggaagaca atggatttcg ccttcacttt 
acacaatggg gatgntttca cacacttngn 
ggaagnactt cganggtac 

<210> 173 
<211> 599 
<212> DNA 
<213> Homo sapiens 



aaactagact tctgacgctg ggctccagct 60 
gagagcagtt gtctgagcaa cctctaagtc 120 
catgacaact tctggtgggg cgagagcagg 180 
aatgagcttc ctagcaagcc agaggaaggg 240 
aatgtcgtag tactgaagat tcttctttcg 300 
ctgccttaat atccactttg gtgccacaca 360 
accanatctc tatgccagnt aggccatttt 420 

439 



<220> 

<221> misc_feature 

<222> 5, 31 

<223> n = A,T,C or G 



<400> 173 

cgatnggccg cccgggcagg tcctgtaaaa 
ttgaatgtgg agctgactgc aatattttgt 
cgaagcagtc taacaatgtg cttgtgtacg 
caagagtagc agaagagaca ataaaggatt 
cagtttttcc aatcgcatgt catcgactct 
attaccaacc cccacagaac ataccagaag 
caaacttttt gggtaaagaa gttattgctc 
tagttctgaa tgataaattt cagcttcctg 
tggagtgcag tggcgcggat tacagctcac 
cctcccacct cagcctccta actacctggg 



naggaaattc agacatcgta cgactcgtaa 60 
caaagcacca gaatagtgcc ctgcactttg 120 
acttgctgaa gaaccattta gagacacttt 180 
actttgaagc tcgccttgct ctgctagaac 240 
gtgagggtcc agatttttca acagatttca 300 
gctctggcat cctgctgttt atcttccatg 360 
ggctctgtgg accgtgtagt gtacaagctg 420 
tttttctggg tctcgctctg ttgtccaggc 480 
tggagtcttg acttcccagg cacaagcaat 540 
actaaaaatg caccgccacc acattccgg 599 



<210> 174 
<211> 458 
<212> DNA 
<213> Homo sapiens 



<220> 

TiT;Tsi. s., 2x3, 2e., 327, 3.1, 359, 377, 

<223> n = A,T,C or G 



<400> 174 

tcgatttggc cgcccgggca 
ncccatcccc gaggccgaca 
ccctgcgctg ttccccagag 
cataggcatg ggtcccccca 
tggagaactg aggttcacag 
tattctgtct gctgaatcca 
cataccaggt agtaagntcg 
gggcaaggtt ttgctgactg 



ggtccatgcn gnttntgccc 
tccccatgtt catgttcatg 
gggccattcc catggtgccc 
ggagagggtt agnttgaggc 
nctccaaaac tttgagtcat 
ttgtatncag tgatggcctg 
tctaggctga tgtttacacc 
attttctgga cccatatc 



attcccatgg ngcccgacaa 60 
cccaccatgc cctggctcat 120 
gtcattacac cgggcatgtt 180 
cggacaggaa gcatgtttga 240 
cacattcata ggctgctgca 300 
ctggggnttt ggaaggctng 360 
tggggtcaga ccaagtanga 420 



<210> 175 
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<211> 1206 
<212> DNA 

<213> Homo sapiens 



<400> 175 

ggcacgagga 

cagttagaga 

ctcatgaaaa 

tgctaaaact 

ttgaggacat 

aagaggaatc 

ctgagaacac 

cagaaattga 

tgacatcaag 

gaaaaatgaa 

tttctgaagc 

ctctaagaga 

gtcaaatgaa 

ctgaacagca 

ttcaacagca 

atattcattt 

tatttaatta 

aaacagaagt 

gtaattccag 

agctctgata 

cgtgcc 



agttttgtgt 
accaaaaagt 
tgaaaattat 
ggaaatagcc 
taagatttta 
attaactaaa 
aatgctcact 
atcacaccat 
aaaaagtcaa 
tgttgatgtg 
tcaaaggaaa 
aaatacattg 
ggaagctgaa 
ggagtctcta 
attagttcat 
tcttgagagg 
caataaccat 
tatataatag 
tgtttgtcac 
cataatcata 



actgaaaaag 
taaatgggaa 
ctcttacatg 
acactgaaac 
aaagaaaaga 
agggcatctc 
tctaaattga 
cctagactgg 
gaacctgctt 
agtagtacga 
tccaaaagcc 
gtttcagaac 
cacatgtatc 
gatcagaaat 
gcacataaga 
aaaatgcaac 
ttaaaaaacc 
tataacactg 
gtggttgttg 
atgataatta 



aaactgtcag 
caagagctct 
aaaattgcat 
accaatacca 
atgctgaact 
aatatagtgg 
aggaaaaaca 
cttctgctgt 
tccacattgc 
tatataacaa 
taaaaattaa 
atgcacaaag 
aaaacgaaca 
tatttcaact 
aagctgacaa 
atcatctcct 
gtatatatca 
ccaaggagcg 
aataaatgaa 
tttcaatgca 



aagcaaaaga 
gcagtgtgag 
gttgaaaaag 
ggaaaaggaa 
tcagatgacc 
gcagcttaaa 
agacaaagaa 
acaagaccat 
aggagatgct 
tgaggtgctc 
tctcaattat 
agaccaacgt 
agataatgtg 
acaaagcaaa 
caaaagcaag 
aaaagagaaa 
atatgaaaaa 
gattatctca 
taaagaatga 
caactacggg 



aataaaatca 
gtttctcaca 
gaaattgcca 
aataaatact 
ctaaaactga 
gttctgatag 
atactagagg 
gatcaaattg 
tgtttgcaaa 
catcaaccac 
gccggagatg 
gaaacacagt 
aacaaacaca 
aatatgtggc 
ataacaattg 
aatgaggaga 
gagaaagcag 
tcttcatcct 
gaaaaccaga 
tggtgctgct 



<210> 176 
<211> 317 
<212> PRT 
<213> Homo sapiens 

<400> 176 

Met Gly Thr Arg Ala 

1 5 
Glu Asn Tyr Leu Leu 
20 

Met Leu Lys Leu Glu 
35 

Glu Asn Lys Tyr Phe 
50 

Glu Leu Gin Met Thr 
65 

Ala Ser Gin Tyr Ser 
85 

Met Leu Thr Ser Lys 
100 

Ala Glu lie Glu Ser 
115 

His Asp Gin lie Val 
130 

He Ala Gly Asp Ala 
145 

Ser Thr lie Tyr Asn 
165 

Gin Arg Lys Ser Lys 
180 

Ala Leu Arg Glu Asn 



Leu Gin Cys Glu 
His Glu 
lie Ala 



Glu Asp 

55 
Leu Lys 
70 

Gly Gin 

Leu Lys 

His His 

Thr Ser 
135 
Cys Leu 
150 

Asn Glu 
Ser Leu 
Thr Leu 



Asn Cys 

25 
Thr Leu 
40 

He Lys 



Leu Lys 

Leu Lys 

Glu Lys 
105 
Pro Arg 
120 

Arg Lys 

Gin Arg 

Val Leu 

Lys He 
185 
Val Ser 



Val Ser 
10 

Met Leu 

Lys His 

He Leu 

Glu Glu 
75 

Val Leu 
90 

Gin Asp 

Leu Ala 

Ser Gin 

Lys Met 
155 
His Gin 
170 

Asn Leu 



His Thr 

Lys Lys 

Gin Tyr 

45 
Lys Glu 
60 

Ser Leu 

He Ala 

Lys Glu 

Ser Ala 
125 
Glu Pro 
140 

Asn Val 



Pro Leu 
Asn Tyr 
Glu His Ala Gin 



His Glu Asn 
15 

Glu He Ala 
30 

Gin Glu Lys 

Lys Asn Ala 

Thr Lys Arg 
80 

Glu Asn Thr 
95 

He Leu Glu 
110 

Val Gin Asp 

Ala Phe His 

Asp Val Ser 
160 

Ser Glu Ala 
175 

Ala Gly Asp 
190 

Arg Asp Gin 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1206 
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195 








20O 




205 








Arg Glu 


Thr 


Gin 


Cys 


Gin Met 


Lys Glu Ala Glu 


His 


Met 


Tyr 


Gin 


Asn 


210 








215 




220 










Glu Gin 


Asp Asn Val Asn Lys 


His Thr Glu Gin 


Gin 


Glu 


Ser 


Leu Asp 


225 








230 


235 










240 


Gin Lys 


Leu 


Phe 


Gin 


Leu Gin 


Ser Lys Asn Met 


Trp 


Leu 


Gin 


Gin 


Gin 








245 




250 








255 




Leu Val 


His 


Ala 


His 


Lys Lys 


Ala Asp Asn Lys 


Ser 


Lys 


He 


Thr 


He 






260 






265 






270 






Asp He 


His 


Phe 


Leu 


Glu Arg 


Lys Met Gin His 


His 


Leu 


Leu 


Lys 


Glu 




275 








280 




285 








Lys Asn 


Glu 


Glu 


He 


Phe Asn 


Tyr Asn Asn His 


Leu Lys 


Asn 


Arg 


He 


290 








295 




300 










Tyr Gin 


Tyr 


Glu 


Lys 


Glu Lys 


Ala Glu Thr Glu 


Val 


He 








305 








310 


315 













<210> 177 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Made in the lab 



<400> 177 

ccaatcatct ccacaggagc 20 

<210> 178 

<211> 1665 

<212> DNA 

<213> Homo sapiens 



<400> 178 

gcaaactttc aagcagagcc tcccgagaag 
caaaagtctg ttccaaataa agccttggaa 
cagatgttcc cttcagaatc aaaacaaaag 
agtctccgtg agactgtttc acagaaggat 
gaaatggata aaataagtgg aaaattagaa 
acagttcatt cttgtgaaag agcaagggaa 
ggaaaaatgg aacaaatgaa aaagaagttt 
aaagaaataa aatcacagtt agagaaccaa 
gtgaggtttc tcacactcat gaaaatgaaa 
aaaaggaaat tgccatgcta aaactggaaa 
aggaaaataa atactttgag gacattaaga 
tgaccctaaa actgaaagag gaatcattaa 
ttaaagttct gatagctgag aacacaatgc 
aagaaatact agaggcagaa attgaatcac 
accatgatca aattgtgaca tcaagaaaaa 
atgcttgttt gcaaagaaaa atgaatgttg 
tgctccatca accactttct gaagctcaaa 
attatgccgg agatgctcta agagaaaata 
aacgtgaaac acagtgtcaa atgaaggaag 
atgtgaacaa acacactgaa cagcaggagt 
gcaaaaatat gtggcttcaa cagcaattag 
gcaagataac aattgatatt cattttcttg 
agaaaaatga ggagatattt aattacaata 
aaaaagagaa agcagaaaca gaaaactcat 



ccatctgcct tcgagcctgc cattgaaatg 60 
ttgaagaatg aacaaacatt gagagcagat 120 
aaggttgaag aaaattcttg ggattctgag 180 
gtgtgtgtac ccaaggctac acatcaaaaa 240 
gattcaacta gcctatcaaa aatcttggat 300 
cttcaaaaag atcactgtga acaacgtaca 360 
tgtgtactga aaaagaaact gtcagaagca 420 
aaagttaaat gggaacaaga gctctgcagt 480 
attatctctt acatgaaaat tgcatgttga 540 
tagccacact gaaacaccaa taccaggaaa 600 
ttttaaaaga aaagaatgct gaacttcaga 660 
ctaaaagggc atctcaatat agtgggcagc 720 
tcacttctaa attgaaggaa aaacaagaca 780 
accatcctag actggcttct gctgtacaag 840 
gtcaagaacc tgctttccac attgcaggag 900 
atgtgagtag tacgatatat aacaatgagg 960 
ggaaatccaa aagcctaaaa attaatctca 1020 
cattggtttc agaacatgca caaagagacc 1080 
ctgaacacat gtatcaaaac gaacaagata 1140 
ctctagatca gaaattattt caactacaaa 1200 
ttcatgcaca taagaaagct gacaacaaaa 1260 
agaggaaaat gcaacatcat ctcctaaaag 1320 
accatttaaa aaaccgtata tatcaatatg 1380 
gagagacaag cagtaagaaa cttcttttgg 1440 
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agaaacaaca gaccagatct ttactcacaa ctcatgctag gaggccagtc ctagcattac 1500 

cttatgitga aaatcttacc aatagtctgt gtcaacagaa tacttatttt agaagaaaaa 1560 

ttcatgattt cttcctgaag cctgggcgac agagcgagac tctgtctcaa aaaaaaaaaa 1620 

aaaaaaagaa agaaagaaat gcctgtgctt acttcgcttc ccagg 1665 

<210> 179 
<211> 179 
<212> PRT 
<213> Homo sapiens 



<400> 179 



Ala 


Asn 


Phe 


Gin 


Ala 


Glu 


Pro 


Pro 


Glu 


Lys 


Pro 


Ser 


Ala 


Phe 


Glu 


Pro 


1 








5 










10 










15 




Ala 


lie 


Glu 


Met 
20 


Gin 


Lys 


Ser 


Val 


Pro 
25 


Asn 


Lys 


Ala 


Leu 


Glu 
30 


Leu 


Lys 


Asn 


Glu 


Gin 
35 


Thr 


Leu 


Arg 


Ala 


Asp 
40 


Gin 


Met 


Phe 


Pro 


Ser 
45 


Glu 


Ser 


Lys 


Gin 


Lys 


Lys 


Val 


Glu 


Glu 


Asn 


Ser 


Trp 


Asp 


Ser 


Glu 


Ser Leu Arg 


Glu 




50 










55 










60 










Thr 


Val 


Ser 


Gin 


Lys 


Asp 


Val 


Cys 


Val 


Pro 


Lys 


Ala 


Thr 


His 


Gin 


Lys 


65 










70 










75 










80 


Glu 


Met 


Asp 


Lys 


He 
85 


Ser 


Gly 


Lys 


Leu 


Glu 
90 


Asp 


Ser 


Thr 


Ser 


Leu 
95 


Ser 


Lys 


lie 


Leu 


Asp 
100 


Thr 


Val 


His 


Ser 


Cys 
105 


Glu 


Arg 


Ala 


Arg 


Glu 
110 


Leu 


Gin 


Lys 


Asp 


His 
115 


Cys 


Glu 


Gin 


Arg 


Thr 
120 


Gly 


Lys 


Met 


Glu 


Gin 
125 


Met 


Lys 


Lys 


Lys 


Phe 
130 


Cys 


Val 


Leu 


Lys 


Lys 
135 


Lys 


Leu 


Ser 


Glu 


Ala 
140 


Lys 


Glu 


He 


Lys 


Ser 


Gin 


Leu 


Glu 


Asn 


Gin 


Lys 


Val 


Lys 


Trp 


Glu 


Gin 


Glu 


Leu 


Cys 


Ser 


145 










150 










155 










160 


Val 


Arg 


Phe 


Leu 


Thr 


Leu 


Met 


Lys 


Met 


Lys 


He 


He 


Ser Tyr Met 


Lys 










165 










170 










175 





He Ala Cys 



<210> 180 

<211> 1681 

<212> DNA 

<213> Homo sapiens 

<400> 180 

gatacagtca ttcttgtgaa agagcaaggg aacttcaaaa agatcactgt gaacaacgta 60 

caggaaaaat ggaacaaatg aaaaagaagt tttgtgtact gaaaaagaaa ctgtcagaag 120 

caaaagaaat aaaatcacag ttagagaacc aaaaagttaa atgggaacaa gagctctgca 180 

gtgtgagatt gactttaaac caagaagaag agaagagaag aaatgccgat atattaaatg 240 

aaaaaattag ggaagaatta ggaagaatcg aagagcagca taggaaagag ttagaagtga 300 

aacaacaact tgaacaggct ctcagaatac aagatataga attgaagagt gtagaaagta 360 

atttgaatca ggtttctcac actcatgaaa atgaaaatta tctcttacat gaaaattgca 420 

tgttgaaaaa ggaaattgcc atgctaaaac tggaaatagc cacactgaaa caccaatacc 480 

aggaaaagga aaataaatac tttgaggaca ttaagatttt aaaagaaaag aatgctgaac 540 

ttcagatgac cctaaaactg aaagaggaat cattaactaa aagggcatct caatatagtg 600 

ggcagcttaa agttctgata gctgagaaca caatgctcac ttctaaattg aaggaaaaac 660 

aagacaaaga aatactagag gcagaaattg aatcacacca tcctagactg gcttctgctg 720 

tacaagacca tgatcaaatt gtgacatcaa gaaaaagtca agaacctgct ttccacattg 780 

caggagatgc ttgtttgcaa agaaaaatga atgttgatgt gagtagtacg atatataaca 840 

atgaggtgct ccatcaacca ctttctgaag ctcaaaggaa atccaaaagc ctaaaaatta 900 



wo 02/083956 



54 



PCT/US02/12378 



atctcaatta tgccggagat gctctaagag aaaatacatt ggtttcagaa catgcacaaa 960 
gagaccaacg tgaaacacag tgtcaaatga aggaagctga acacatgtat caaaacgaac 1020 
aagataatgt gaacaaacac actgaacagc aggagtctct agatcagaaa ttatttcaac 1080 
tacaaagcaa aaatatgtgg cttcaacagc aattagttca tgcacataag aaagctgaca 1140 
acaaaagcaa gataacaatt gatattcatt ttcttgagag gaaaatgcaa catcatctcc 1200 
taaaagagaa aaatgaggag atatttaatt acaataacca tttaaaaaac cgtatatatc 1260 
aatatgaaaa agagaaagca gaaacagaaa actcatgaga gacaagcagt aagaaacttc 1320 
ttttggagaa acaacagacc agatctttac tcacaactca tgctaggagg ccagtcctag 1380 
cattacctta tgttgaaaaa tcttaccaat agtctgtgtc aacagaatac ttattttaga 1440 
agaaaaattc atgatttctt cctgaagcct acagacataa aataacagtg tgaagaatta 1500 
cttgttcacg aattgcataa aagctgccca ggatttccat ctaccctgga tgatgccgga 1560 
gacatcattc aatccaacca gaatctcgct ctgtcactca ggctggagtg cagtgggcgc 1620 
aatctcggct cactgcaact ctgcctccca ggttcacgcc attctctggc acagcctccc 1680 

g 



1681 



<210> 181 
<211> 432 
<212> PRT 
<213> Homo sapiens 

<400> 181 

Asp Thr Val His Ser Cys Glu Arg Ala Arg Glu Leu Gin Lys Asp His 

15 10 15 

Cys Glu Gin Arg Thr Gly Lys Met Glu Gin Met Lys Lys Lys Phe Cys 

20 25 30 

Val Leu Lys Lys Lys Leu Ser Glu Ala Lys Glu He Lys Ser Gin Leu 

35 40 45 

Glu Asn Gin Lys Val Lys Trp Glu Gin Glu Leu Cys Ser Val Arg Leu 

50 55 60 

Thr Leu Asn Gin Glu Glu Glu Lys Arg Arg Asn Ala Asp He Leu Asn 
65 70 75 80 

Glu Lys He Arg Glu Glu Leu Gly Arg He Glu Glu Gin His Arg Lys 

85 90 95 

Glu Leu Glu Val Lys Gin Gin Leu Glu Gin Ala Leu Arg He Gin Asp 

100 105 110 

He Glu Leu Lys Ser Val Glu Ser Asn Leu Asn Gin Val Ser His Thr 

115 120 125 

His Glu Asn Glu Asn Tyr Leu Leu His Glu Asn Cys Met Leu Lys Lys 

130 135 140 

Glu He Ala Met Leu Lys Leu Glu He Ala Thr Leu Lys His Gin Tyr 
145 ' 150 155 160 

Gin Glu Lys Glu Asn Lys Tyr Phe Glu Asp He Lys He Leu Lys Glu 

165 170 175 

Lys Asn Ala Glu Leu Gin Met Thr Leu Lys Leu Lys Glu Glu Ser Leu 

180 185 190 

Thr Lys Arg Ala Ser Gin Tyr Ser Gly Gin Leu Lys Val Leu He Ala 

195 200 205 

Glu Asn Thr Met Leu Thr Ser Lys Leu Lys Glu Lys Gin Asp Lys Glu 

210 215 220 

He Leu Glu Ala Glu He Glu Ser His His Pro Arg Leu Ala Ser Ala 
225 230 235 240 

Val Gin Asp His Asp Gin He Val Thr Ser Arg Lys Ser Gin Glu Pro 

245 250 255 

Ala Phe His He Ala Gly Asp Ala Cys Leu Gin Arg Lys Met Asn Val 

260 265 270 

Asp Val Ser Ser Thr He Tyr Asn Asn Glu Val Leu His Gin Pro Leu 

275 280 285 

Ser Glu Ala Gin Arg Lys Ser Lys Ser Leu Lys He Asn Leu Asn Tyr 
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290 


295 






300 










Ala 


Gly 


Asp Ala Leu Arg Glu Asn Thr 


Leu 


Val 


Ser 


Glu 


His 


Ala 


Gin 


305 




310 




315 










320 


Arg 


Asp 


Gin Arg Glu Thr Gin Cys Gin 


Met 


Lys 


Glu 


Ala 


Glu 


His 


Met 






325 


330 










335 




Tyr 


Gin 


Asn Glu Gin Asp Asn Val Asn 


Lys 


His 


Thr 


Glu 


Gin 


Gin 


Glu 






340 345 










350 






Ser 


Leu 


Asp Gin Lys Leu Phe Gin Leu 


Gin 


Ser 


Lys 


Asn 


Met 


Trp 


Leu 






355 360 








365 








Gin 


Gin 


Gin Leu Val His Ala His Lys 


Lys 


Ala 


Asp 


Asn 


Lys 


Ser 


Lys 




370 


375 






380 










lie 


Thr 


He Asp He His Phe Leu Glu 


Arg 


Lys 


Met 


Gin 


His 


His 


Leu 


385 




390 




395 










400 


Leu 


Lys 


Glu Lys Asn Glu Glu He Phe 


Asn 


Tyr 


Asn 


Asn 


His 


Leu 


Lys 






405 


410 










415 




Asn 


Arg 


He Tyr Gin Tyr Glu Lys Glu 


Lys 


Ala 


Glu 


Thr 


Glu 


Asn 


Ser 






420 425 










430 







<210> 182 
<211> 511 
<212> DWA 
<213> Homo sapiens 



<400> 182 

gaagtttcat gaggtttagc ttttctgggc tggggagtgg agagaaagaa gttgcagggc 60 
ttacaggaaa tcccagagcc tgaggttttc tcccagattt gagaactcta gattctgcat 120 
cattatcttt gagtctatat tctcttgggc tgtaagaaga tgaggaatgt aataggtctg 180 
ccccaagcct ttcatgcctt ctgtaccaag cttgtttcct tgtgcatcct tcccaggctc 240 
tggctgcccc ttattggaga atgtgatttc caagacaatc aatccacaag tgtctaagac 300 
tgaatacaaa gaacttcttc aagagttcat agacgacaat gccactacaa atgccataga 360 
tgaattgaag gaatgttttc ttaaccaaac ggatgaaact ctgagcaatg ttgaggtgtt 420 
tatgcaatta atatatgaca gcagtctttg tgatttattt taactttctg caagaccttt 480 
ggctcacaga actgcagggt atggtgagaa a 511 

<210> 183 
<211> 260 
<212> DNA 
<213> Homo sapiens 



<400> 183 

cacctcgcgg ttcagctcct ctgtcttggt 
cttctctgcc atcttctcat actggtcacg 
gccaggtgca gcgtccatct ccacattgac 
catctcctcc tcgtggttct tcttcaggta 
catctccagg tcagctctgg 

<210> 184 
<211> 461 
<212> DNA 
<213> Homo sapiens 



gaagaaccat tcctcggcat ccttgcggtt 60 
catctcgttc agaatgcggc tcaggtccac 120 
atctccaccc acctggcctc tcagggcatt 180 
ggccagctcc tccttcaggc tctcaatctg 240 

260 



<400> 184 

gtctgatggg agaccaaaga atttgcaagt 
agggcctttt ctagctgtat gactgttact 
ctctatttta gatagattaa cattaaccaa 
atttctaagt cagcctctag tcgtggttca 
gtctgaagaa aggaaagagg aaagcaaata 



ggatggtttg gtatcactgt aaataaaaag 60 
tgaccttctt tgaaaagcat tcccaaaatg 120 
cataattttt tttagatcga gtcagcataa 180 
tctctttcac ctgcatttta tttggtgttt 240 
cgaattgtac tatttgtacc aaatctttgg 300 
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gattcattgg caaataattt cagtgtggtg tattattaaa tagaaaaaaa aaattttgtt 360 
tcctaggttg aaggtctaat tgataccgtt tgacttatga tgaccattta tgcactttca 420 
aatgaatttg ctttcaaaat aaatgaagag cagacctcgg c 461 



<210> 185 
<211> 531 
<212> DNA 
<213> Homo sapiens 



<400> 185 

tctgatttta tttccttctc aaaaaaagtt 
caggcagtga acttgacatg attagctggc 
tgtttttgtg gccttgaatt ttaagacaaa 
tggcaaaaaa aagtttaaaa acaaaaaccc 
cctagtgcct gtcatgttat attaaacata 
acagacttaa atgtacaaag atgttttcca 
aaacctgaac acatatgcta tcatcatgcc 
agtagaaagg attaaatagt caaatacaag 
ctcaaatcgg catttagata gatccagtgg 

<210> 186 
<211> 441 
<212> DKA 
<213> Homo sapiens 



atttacagaa ggtatatatc aacaatctga 60 
atgatttttt cttttttttc ccccaaacat 120 
tattctacac ggcatattgc acaggatgga 180 
ttaacggaac tgccttaaaa aggcagacgt 240 
catacacaca atctttttgc ttattataat 300 
cttttttcaa tttttaaaca caacagctat 360 
ataagactaa aacaattata tttagcgaca 420 
aatgaaaaac gcagtacata gtgtcgcgaa 480 
tttaaacggc acgtttttgc t 531 



<400> 186 

cattcctttc ctcgcgttgg ggtttctctg 
atcaaaagaa tcatcatctt taccttgact 
aagatagggc acagccattg ccttggcctc 
tctcttgcca agtttcccaa ccactcgagg 
gggctttccc gagggcttca ccgtgagccc 
aatgtctgaa acctcgctct ctgcctgctg 
ctccagctct gacagctcct catctgtggt 
ctgggggctt ttttcctgtc t 

<210> 187 
<211> 371 
<212> DNA 
<213> Homo sapiens 



tgtcagcgag cctcggtaca ctgatttccg 60 
tttcagggaa ttactgaact ttcttctcag 12 0 
acttgaaggg tctgcatttg ggtcctctgg 180 
gagaaatatc gggaggtttg acttcctccg 240 
tgcggccctc agggctgcaa tcctggattc 300 
gacttctgag gccgtcactg ccactctgtc 360 
cctgttgtac tggacggggt ccccagggtc 420 

441 



<400> 187 

aaaagtgaat gagtaactat tatattgttg 
caggctgctg atggtgagag tgaactctgt 
gaccccagat tctaaactag acgccttatg 
ctgttgatac caggccaacc aactactaac 
tctgtctcct acagttgcag acagggtgga 
ggcacctggg agccagagca gcaggagccc 
gctgagtcct g 

<210> 188 
<211> 226 
<212> DNA 
<213> Homo sapiens 



gcaataataa gttgcaaaat catcaggctg 60 
cccagatcca ctgccgctga accttgatgg 120 
gatcaggagc tttggggctt tccctggttt 180 
actctgactg gcccggcaag tgatggtgac 240 
aggagactgg gtcatctgga tgtcacattt 300 
caggagctga gcggggaccc tcatgtccat 360 

371 



<400> 188 

ggtatataaa ttgagatgcc cccccaggcc 
ttttattcct tgatattttt cttttttttt 
taaaggtgct atttaacatg ggaggagagc 



agcaaatgtt cctttttgtt caaagtctat 60 
tttttgtgga tggggacttg tgaatttttc 120 
gtgtgcggct ccagcccagc ccgctgctca 180 
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ctttccaccc tctctccacc tgcctctggc ttctcaggac ctgccc 226 

<210> 189 
<211> 391 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<222> 43, 112, 131, 156, 195, 208, 221, 317, 333, 367 
<223> n = A,T,C or G 



<400> 189 

tgggtgaagt ttattctgtt ttcacatcta 
tggattctgg gcatcgtcgg cgcatgcttg 
cctcggccgc naccacgcta agggcgaatt 
cgagcatgca tctanagggc ccaattcncc 
cgtcgtttta caacgtcgtg actgggaaaa 
agcacatccc cctttcncca gctggcttaa 
ccaacanttg cgcagcctga atggcgaatg 

<210> 190 
<211> 501 
<212> DNA 
<213> Homo sapiens 



ggttgttggg ganagtgata gacaaagttc 60 

taatcctact tgggaggttg anacaggaga 120 

ctgcanatat ccatcacact ggcggccgct 180 

ctatagtgag ncgtattaca attcactggc 240 

ccctggcgtt acccaactta atcgccttgc 300 

tancgaagag gcccgcaccg atcgcccttc 360 
g 391 



<400> 190 

catcttggcc tttttgagct gtttccgctt 
actggaggag ctggcagagg cgttgctgtc 
acctgggttg aatgactcat cggtttcttc 
ctcctcccgg atcttgcctt cctccttcat 
atcagagtca gcatattcat cgtagcttgg 
gatgttgagc tttttcgcgt tgacaaaatc 
gctgctgaag gtatactgag tgccctgctt 
agtagtggta ccacgagcaa agttgacaaa 
tggcttgtgg acgtagatga a 

<210> 191 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 49 

<223> n = A,T,C or G 



cttctcatcc cggtcactgt caccctcatt 60 

aaactcctct gccacatctt cctcctcttc 120 

tcctgagtca tcgctgctgt cattggcatt 180 

cctctccaag taggcatcat gctggtcctc 240 

gttcatgccc tctttcaatc ctcggttttt 300 

aaacagtttc ccgtactcct ccctctcaat 360 

ggtctcaatt tcaaagtcaa aggaacgagt 420 

ggagatctca tcgaagcgga tgtgcacagg 480 

501 



<400> 191 

ggaaaaactg tgaaaaatat atctgaattt 
aacagaaagt aaaaactaac atggattgct 
gatacaattc tctcatgcta ctctaaagtt 
ctgtacacaa aaggaatacc ttctgagagc 
a 

<210> 192 
<211> 271 
<212> DNA 

<213> Homo sapiens 



attaagtaca gtataaaana gggttgtggc 60 
ataaatatgc tgaagcctag ttgttcaaat 120 
tataaagaaa aaggatttac actttacaca 180 
cagggagtgg ggaaagggga aggagacttg 240 

241 
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<220> 

<221> misc_feature 

<222> 6, 17, 23, 26, 70, 227, 245 

<223> n = A,T,C or G 



<400> 192 

tggtcntgga ttcacanata aantanatcg 
gatagaagan caaaccacgt cccacgaatc 
ttaacaattt gaaaaattat tctttaatgt 
ataatttcat ttccacattg attaaagctg 
atagnggaaa tgaaattatg atttattaat 



actaaaactg gcagaaattg tgaagcaggt 60 
ccaataatga cagcttcaga ctttgctttt 120 
ataaagtaat tttatgtaaa ttaataaatc 180 
ctgtatagat ttagggngca ggacttaata 240 



<210> 193 
<211> 351 
<212> DNA 
<213> Homo sapiens 



<400> 193 

agtcgaggcg ctgatcccta aaatggcgaa 
taacttcctg tgcctttcct atcacctcga 
gaccaccgtt cagtcatttt gggttgccgt 
gcattcaaaa agacagcaga caaaatgaaa 
gcctatctaa tttctttagt tttctatttg 
ccagcctact gtactattta aaatgctcaa 

<210> 194 
<211> 311 
<212> DNA 
<213> Homo sapiens 



catgtgtttt catcatttca gccaaagtcc 60 
gaagtaatta tcagttggtt tggatttttg 120 
gctcccaaaa cattttaaat gaaagtattg 180 
gaaaatgaga gcagaaagta agcatttcca 240 
cctccagtgc agtccatttc ctaatgtata 300 
tttcagcacc gatggacctg c 351 



<400> 194 

ctgagacaca gaggcccact gcgaggggga 
caccctccct ctgctgggat gaggtccagg 
ctctggtgtt cccaccaccc tagatgaaaa 
ttccatccct aaaccacttc cttaaaatgt 
ggagcctggg gttggatagg gccatggctg 
tgacacagac c 

<210> 195 
<211> 381 
<212> DNA 
<213> Homo sapiens 



cagtggcggt gggactgacc tgctgacagt 60 
agccaactaa aacaatggca gaggagacat 120 
tccacagcac agacctctac cgtgtttctc 180 
ttggatttgc aaagccaatt tggggcctgt 240 
gtcccccacc atacctcccc tccacatcac 300 

311 



<400> 195 

tgtcagagtg gcactggtag aagttccagg 
gccaacagga tgacatgaaa tgatgtactc 
ggttgtctga gagagagctt cttgtcctgt 
ctgattattc ttcagggcaa tgacataaat 
aatagtagcc tctgtgacac cagggcgggg 
aggcttctca tacttgatga tgtagccggt 
cagcagggaa ttgggtgtgg t 

<210> 196 

<211> 401 

<212> DNA 

<213> Homo sapiens 



aaccctgaac tgtaagggtt cttcatcagt 60 
agaagtgtcc tggaatgggg cccatgagat 120 
ctttttcctt ccaatcaggg gctcgctctt 180 
tgtatattcg gttcccggtt ccaggccagt 240 
ccgagggacc acttctctgg gaggagaccc 300 
aatcctggca cgtggcggct gccatgatac 360 

381 
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<400> 196 

cacaaacaag aggagcacca gacctcctct 
gcccaaacct ggagacctga ttgagatttt 
tataggagat ggctacgtga tccatctggc 
ctccagtgtc ttctcagtcc tgagcaacag 
tgtggtggga ggctgttgct atcgggtcaa 
gcccgtggag gtgatcacca gttctgcgaa 
tattgtgagc aggaactgtg agcactttgt 

<210> 197 
<211> 471 
<212> DNA 
<213> Homo sapiens 



tggcttcgag atggcttcgc cacaccaaga 60 
ccgccttggc tatgagcact gggccctgta 120 
tcctccaagt gagtaccccg gggctggctc 180 
tgcagaggtg aaacgggagc gcctggaaga 240 
caacagcttg gaccatgagt accaaccacg 300 
ggagatggtt ggtcagaaga tgaagtacag 360 
cacccagacc t 401 



<400> 197 

ctgtaatgat gtgagcaggg agccttcctc 

aactttgtac cttgattgcc ttacaaagtt 

cctcctgccc caaagcttgt gggcacatgg 

atatatgtac agacatgtac tctcacacac 

gtacagctcc caggaacagc taggtgggaa 

ccctgaacaa aaattgggca ctcatctatt 

ccaaactctg gaaaggaccc aatgtaccag 

ggaagagagc tgcttaaact cacacaacaa 

<210> 198 
<211> 201 
<212> DNA 
<213> Homo sapiens 



cctgggccac ctgcagagag ctttcccacc 60 
atttgtttac aaacagcgac catataaaag 120 
gcacatacag actcacatac agacacacac 180 
acaggcacca gcatacacac gtttttctag 240 
agtcccatca ctgagggagc ctaaccatgt 300 
ccttttctct tgtgtcccta ctcattgaaa 360 
tatttatacc tctagtgaag cacagagaga 420 
tgaactgcag acacagacct g 471 



<400> 198 

ggtccattga ggctctgtcg gccatgccca 
aagcccagaa gtttagggaa aagctgcaag 
tggcctatgc agtcaactcc tggaccacta 
tcctctacat tggtgggcag a 

<210> 199 
<211> 551 
<212> DNA 
<213> Homo sapiens 



cagttcgaag ctttgccaac gaggagggcg 60 
aaataaagac actcaaccag aaggaggctg 120 
gtatttcagg tatgctgctg aaagtgggaa 180 

201 



<400> 199 

tctggcacag atcttcaccc acacggcggt 

gggcctggaa cacaccatct tccccatgag 

ggcccctggc cttatgtccc agttatgacc 

actccagtcc atctctgaca tttttaacac 

acctcgattc ccatcttgag cccagtgtta 

tccaaccttg tgatcctaat tctgggacct 

agctcctgac cttagtcctg accgctaccc 

gggtggcccc tgaccttatt actgtcattt 

ctttatgacc tcctactctc aattttaact 
atgaccacaa t 

<210> 200 
<211> 211 
<212> DNA 
<213> Homo sapiens 



ccacgtgctg atcatcttcc gggtctcacc 60 
cccggtgccc agtctggtga cttccatctt 120 
cctgacttca actctggctc ttaccctgta 180 
ccggccttgt gaccgtggac atagctcctg 240 
gtccatgaga tcatgacctg actcctggtc 300 
caatcctagc ctctgaactt gggaccctgg 360 
ttgattctga cctttgatcc tgtaacttag 420 
agctccttga ccttgccact tcaatcctgg 480 
ttaaccaaat gaccaaattt gtgacactaa 540 

551 



<220> 
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<221> misc_feature 

<222> 8, 36, 40, 78, 165, 170, 171, 173, 203, 207, 208 
<223> n = A,T,C or G 



<400> 200 

cagctcancg ggcgacatgc ccctacaagt 
tgtaagagag gctgctgnca ccattacctg 
tactgctagg gggcacatag cgcccatggg 
ggtaggtatc ccgggctgga aanatgnnca 

<210> 201 
<211> 111 
<212> DMA 
<213> Homo sapiens 



tggcanaagn ggctgccact gctgggtttg 60 

cagaaacctt ctcatagggg ctacgatcgg 120 

tgtggtaggt ggggnactcn ntnataggat 180 

g 211 



<400> 201 

ccagtgaaag gaaacaaaac tggcagtttg tccatttgaa tatcagacct agtttcttct 60 
taatttccac actatttctc ccatattcct taaacttctt ggcatccacc t 111 



<210> 202 
<211> 331 
<212> DNA 
<213> Homo sapiens 



<400> 202 

tgaaaataca gaataccagg tggtcccaaa 

ggagagagag agagaggaaa attccctaac 

ctcaaatgat gcttttaagg gaggacagtg 
aatacataga agggttatca aaccactcaa 

ctttttttct ttcggttcaa tgaaaagcta 

ttattttact ttttaaaaat ttgtccagac 



tgtttgaagt tctttgaaca gaaagagaga 60 

ccttggttta aagacaatat tcattta-ttg 120 

gaataaaata aacttttttt ttctccctac 180 

gtttcaaaat ctttccaggg tccaatatca 240 

aatgtaataa tactaattat agataaaatt 300 
c 331 



<210> 203 
<211> 491 
<212> DNA 
<213> Homo sapiens 



<400> 203 

agtcacccag tctacttagt acctggttgc 
ggctttagtg taaccaataa atctgtagtg 
gggaaggtag gaatgggcta agtatgatga 
atcacagaaa acctaattca aactggctta 
tagaaagtcc ataggtagtg ctggctccag 
ctctaaatac cggactgact tttttctcac 
tgggccactt aatggctgcc agcattccta 
cagaaaaaga aggcatcttt gtaccagaaa 
ttaggacctg c 

<210> 204 
<211> 361 
<212> DNA 
<213> Homo sapiens 



tgcctctgac cttttcagct tgataccctg 60 

accttacctg tattccctgt gctatcctgt 120 

atgtataggt tagggatctt ttggttttaa 180 

aaataaaaag gatttattgg ttcatgtaac 240 

gtgaagactt gacccagtag ttcagtatgt 300 

tgttgcatct tctgtaggac catttaagtc 360 

agattacact tttccccatt tatgtccaat 420 

tctcagcaaa agccctaata ttcacactga 480 

491 



<400> 204 

tcccttcctc ccccatgtga taaatgggtc 
actgccgctc tcagtggaca gggcatctgt 
ttttcatttg atttttgt-ta agagtgcagt 
ttgattaaca tgattttcct ggttgttaca 



cagggctgat caaagaactc tgactgcaga 60 
tatcctgaga cctgtggcag acacgtcttg 120 
attgcagagt ctagaggaat ttttgtttcc 180 
tccagggcat ggcagtggcc tcagccttaa 240 
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acttttgttc ctactcccac cctcagcgaa ctgggcagca cggggagggt ttggctaccc 300 
ctgcccatcc ctgagccagg taccaccatt gtaaggaaac actttcagaa attcagacct 360 
c 361 



<210> 


205 


<211> 


471 


<212> 


DNA 


<213> 


Homo i 


<220> 




<221> 


misc_: 


<222> 


2, 3 


<223> 


n = A, 



<400> 205 

cnngtacagt tcttcctgga tggccgacac agatcctggg gaaaggcaat cctggcactg 60 
ctctgaaacc agagctcctc ctccctcccc gggcagggtg gagctgagaa gggctgctct 120 
agcgttggga ctccacctcc atacacctga tattttgata gggcaggtcc ctgctatggg 180 
ccactgttct gggcagtata gtatgcttga cagcatcctt ggcatctatc caccagatcc 240 
cagagcaccc gctactagct gtgacaacat cctccaaaca ttgcaaaatt tcccctggga 300 
ggcaagattg cctcagatgg gagaatcacg ctctagggaa atctgctggt atgagaaccc 360 
caactcccca ctccactgag cctccagatg gcgagcaggc tgcagctcca gcacagacac 420 
gaagctccct ccagccactg acggtccatg gctggggtta cccaggacct c 471 

<210> 206 
<211> 261 
<212> DNA 
<213> Homo sapiens 



<400> 206 

tagagtattt agagtcctga gataacaagg 
tgtcctttct tcccaatcag agatttgtgg 
gtagccttga tgagagaatc caattcttca 
ggggtaatac gctttacctt taagtctttt 
gcggtgaggt actccaggat g 

<210> 207 
<211> 361 
<212> DNA 
<213> Homo sapiens 



aatccaggca tcctttagac agtcttctgt 60 
atgtgtggaa tgacaccacc accagcaatt 120 
tctccacgaa tagcaagttg caagt.gacga 180 
gatgcatttc ctgccagttc aagtacctct 240 

261 



<400> 207 

gctctccggg agcttgaaga agaaactggc 
gcggtctgta tggacccagg cttgtcaaac 
aacggagatg atgccgaaaa cgcaaggccg 
gaagtcattt ctttacccaa gaatgacctg 
gaacatctca cagtggacgc cagggtctat 
gcaaagccat ttgaagtgcc cttcttgaaa 
c 



tacaaagggg acattgccga atgttctcca 60 

tgtactatac acatcgtgac agtcaccatt 120 

aagccaaagc caggggatgg agagtttgtg 180 

ctgcagagac ttgatgctct ggtagctgaa 240 

tcctacgctc tagcactgaa acatgcaaat 300 

ttttaagccc aaatatgaca ctggacctgc 360 

361 



<210> 208 
<211> 381 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> lOr 27r 37, 46, 75, 95, 102, 137, 143, 202, 234, 278, 310, 
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351 

<223> n = A,T,C or G 



<400> 208 

agaggagatn tttgccatgc ctgaatnctt 
cttagtctgc tttgntaaaa gcaagtatta 
tagctaacta ataaagnttg atntaggcat 
ctctcatgtt tgatgtattt tncaaactaa 
attaaatcat ttatttcctt tactttctca 
gggaacccan ttttggaaaa ccagattcat 
tttttgaaaa gatgtggacc t 

<210> 209 
<211> 231 
<212> DNA 

<213> Homo sapiens 



tcctatncca ccctancact taacatatta 60 
ccttnaactt gnctcttact ctttgccctt 120 
tattatataa ttctgagtca ttcatggtat 180 
gatctatgat agtttttttt ccanagttcc 240 
octet gtnga aacatttaga aactggattt 300 
agtcatgaaa atggaaactt ncatattctg 360 

381 



<220> 

<221> misc_feature 
<222> 83 

<223> n A,T,C or G 



<400> 209 

gtggagagca agtgatttat taaagcaaga 
tcagggtgtc attgaaagac agnggaaacc 
catttcttca atattttcac caggacttcc 
tgatccgtgc atctcttcac tcctaacttg 

<210> 210 
<211> 371 
<212> DNA 

<213> Homo sapiens 



cgttgaaacc tttacattct gcagtgaaga 60 
aggatgaaag tttttacatg tcacacacta 120 
gcaatgaggc ttcgtttctg aagggacatc 180 
gctgcaacag cttccacctg c 231 



<400> 210 

tccatcctgg ttttgcagag atcaggttgt 
atgtcagtta tctccactaa catatccaag 
aggtttttta ggtagaactc ttcttttaag 
gtcttaaagc tgtctttctg agatctaatt 
cctcacacca ttaggtgatg ctttggacag 
aattccttgg cttcggctcc tcaccacttt 
aatgcctatg c 



tgacagttcc tggttgaccc acagctaccc 60 
aatctttgta ggacaatttc tccacctgca 120 
gcaattagcc cattgccaaa aggttttact 180 
ccaaggactt ctccacagct aagtgagatg 240 
aacagagtat tttcatcttg tgtttaaagc 300 
ctatgccagt ctcccattta tgtccctagt 360 

371 



<210> 211 
<211> 471 
<212> DNA 

<213> Homo sapiens 



<400> 211 

tttattttaa aagaaaaaaa ttaaaataga 
attgagacac aaggggacct acatgttctg 
ttccagatac agtatgacag aggaacagtg 
agaaacggga agtctaacag ttatgttttc 
ttaaggaatt ttataggaag aattttagca 
tttagccctg cctatctcca gtcttggaat 
taaaatataa acaagaatag taagtccatc 
aagaaatgtt gggtcatttt tcctatgaaa 



gccaacaaat gcaattaaga aaaaaaaagt 60 
gtctaagaag catgcaagta ttacaaagca 120 
aacaagcatt ggaacgatgc tctttctttc 180 
acaatggtag tgattaaacc atctttattt 240 
ccatcattaa aggaaaaata ataatacctt 300 
aataacagaa gcatagcacc tttcagtatc 360 
ccagcttcta gagatgaggt agctcatgct 420 
gttcaaaggc caaatggtca c 471 



<210> 212 
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<211> 401 
<212> DNA 
<213> Homo sapiens 



<400> 212 

tggcctgtct 

atattttggg 

gccccaaata 

tatagaggac 

tcttcatgat 

agtttccaac 

atatcaggac 



ccttcacata 
ttcaaaaaaa 
tttcctcatc 
aggtcagctc 
taaattcagc 
ctcagcccat 
tggttacttg 



gtccatatca 
gtaaaaagat 
tttttgttgt 
tctggctcgg 
ctaaacgttt 
ctgcgggcag 
gttaaggagg 



ccacaaatca 
aatgtagctg 
tgtcatggat 
tgatctacat 
tgccgggaac 
agaaggtcta 
ggtctacctc 



cacaacaaaa 
catttctttg 
ggtggtgaca 
tctgaagttg 
actgcagaga 
gtttgtccat 

g 



gggagaggat 60 
gttattttgg 120 
tggacttgtt 180 
tctgaaaatg 240 
caatgctgtg 300 
caccattatg 360 
401 



<210> 213 
<211> 461 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc^feature 

<222> 239, 290, 358, 359, 391, 393 
<223> n == A,T,C or G 



<400> 213 

tgtgaagcat acataaataa atgaagtaag 
ttccctaaga cctgaaaatg aacatagtat 
tttcctcaca caatttggaa tcatataata 
acggatagaa tgcatcaagt gtttattatg 
aaaaggtgtt tgcccattct aagaaatgag 
tcctgttagg tggagtgtat gtgttgacat 
ccccattatt tgaataaagt gactgctgaa 
aatgtttaaa gaaaaaccta taatggaaag 

<210> 214 

<211> 181 

<212> DNA 

<213> Homo sapiens 



ccatactgat ttaatttatt ggatgttatt 60 
gctagttatt tttcagtgtt agccttttac 120 
taggtacttt gtccctgatt aaataatgtg 180 
aaaagagtgg aaaagtatat agcttttanc 240 
cgaatatata gaaatagtgn gggcatttct 300 
ttctccccat ctcttcccac tctgtttnnt 360 
nangactttg aatccttatc cacttaattt 420 
tgagactcct t 461 



<400> 214 

cctgagcttc tactcctttc ccttaagatt 
cagctcccca gacccacacc aagaacccca 
cagataagtc ctaaggagaa tgccgaagcg 



cctccaaagc accagctcca taaaatcctt 60 
catgttaatt ggatcagcca aatctacaag 120 
tttttcttct tcctcaagcc tagcatgaga 180 

181 



<210> 215 
<211> 581 
<212> DNA 
<213> Homo sapiens 



<400> 215 

ctgctttaag aatggttttc caccttttcc 
ttatttaaat ctgcacctct ctctatttta 
agtcccagca cagtgggaca aaaagaattt 
tgaacagaac cagtatctgt catggaactg 
ttattcactt gttcattcaa gtatttattg 
agtgcgcttt ggaaatttat tccagttttc 
tttctttctg ccaccattta ggtgatggtg 
ttagcttatt gtgactgttt cagatctagt 



ccctaatctc taccaatcag acacatttta 60 
tttgccaggg gcacgatgtg acatatctgc 120 
agaccccaaa agtgtcctcg gcatggatct 180 
aacattcatc gatggtctcc atgtattcat 240 
aatacctgcc tcaagctaga gagaaaagag 300 
agcctacagc agattatcag ctcggtgact 360 
tttgattcag agatggctga atttctattc 420 
ttgggaacag attagaggcc attgtcctct 480 
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gtcctgatca ggtggcctgg ctgtttcttt ggatccctct gtcccagagc cacccagaac 540 
cctgactctt gagaatcaag aaaacaccca gaaaggacct c 581 

<210> 216 
<211> 281 
<2i2> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 37, 38, 164, 176, 254 

<223> n = A,T,C or G 

<400> 216 

ccgatgtcct gcttctgtgg accaggggct cctctgnngg tggcctcaac cacggctgag 60 
atccctagaa gtccaggagc tgtggggaag agaagcactt agggccagcc agccgggcac 120 
ccccacttgc gccccgaccc acgctcacgc accagacctg cccnggcggt cgctcnaaag 180 
ggcgaattct gcagatatcc atcacactgg cggacgctcg agcatgcatc tagagggccc 240 
aattcaccct atantgagtc gtattacaat tcactggccg t 281 

<210> 217 
<211> 356 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_feature 

<222> 33, 322 

<223> n = A,T,C or G 



<400> 217 

atagcaggtt 

gtggtttgca 

gtcttgcaga 

cctcaacatt 

tggcatttgt 

tagacacttg 



tcaacaattg 
gcaatccgta 
aagttaaaat 
gctcagagtt 
agtggcattg 
tggattgatt 



tcttgtagtt 
gttggtttct 
aaatcacaaa 
tcatccgttt 
tcgtctatga 
gncttggaaa 



tgnagtaaaa 
caccataccc 
gactgctgtc 
ggttaagaaa 
actcttgaag 
tcacattctc 



agacataaga 
tgcagttctg 
atatattaat 
acattccttc 
aagttctttg 
caataaggga 



aagagaaggt 60 
tgagccaaag 120 
tgcataaaca 180 
aattcatcta 240 
tattcagtct 300 
cctcgg 356 



<210> 218 
<211> 321 
<212> DNA 
<213> Homo sapiens 



<400> 218 

ttgtccatcg 

ctgctctgcc 

gtgtgggcac 

atatagagct 

ggtctggggc 

ggtatctctg 



ggagaaaggt 
aatttctgga 
tcatccaagt 
tcttcatatg 
agtttgggtc 
gacctgcctg 



gtttgtcagt 
tttctttatt 
gatgaataat 
tctgagtcca 
gagagtcctt 

g 



tgtttcataa 
ttcagcaaac 
catcaagggt 
gatgagttgg 
tgtgtccttt 



accagattga 
actttcttta 
ttgttgcttg 
tcaccccaac 
ttggctccag 



ggaggacaaa 60 
aagcttgact 120 
tcttggattt 180 
ctctggagag 240 
gtttgactgt 300 
321 



<210> 219 
<211> 271 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc feature 
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<222> 41 

<223> n = A,T,C or G 



<400> 219 

ccggttaggt ccacgcgggg gcagtggagg 
accctatggc ttacaaagta gagttggccc 
actcctaaca gtcttccttg ccctgccatc 
gggcatgctt tctaaacaca gccacaggag 
agtcttagat aagtaaggtg acttgtctaa 

<210> 220 
<211> 351 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 32, 43 

<223> n = A,T,C or G 



cacaggctca nggtggccgg gctacctggc 60 
agtttccttc cacctgaggg gagcactctg 120 
atctggggtg gctggctgtc aagaaaggcc 180 
gcttgtaggg catcttccag gtggggaaac 240 
g 271 



<400> 220 

gtcctacgac gaggaccagc ttttcttctt 
cctgcccgaa tttgctgact gggctcagga 
caaagagttc tgcgagtgga tgatccagca 
ggtgtccaga gggtttccta tcgctgaagt 
gcccaacact ttggtctgtt ttgtcagtaa 
gtagcatcat tccgtccctg tggaaggatt 

<210> 221 
<211> 371 
<212> DNA 
<213> Homo sapiens 



cnacttttcc canaacactc gggtgcctcg 60 
acagggagat gctcctgcca ttttatttga 120 
aatagggcca aaacttgatg ggaaaatccc 180 
gttcacgctg aagcccctgg agtttggcaa 240 
tctcttccca cccatgctga cagtgaactg 300 
tgggcctact tttgtctcag a 351 



<400> 221 

gtctgcagaa gcgtgtctga ggtgtccggt 
accgtgctgg ggacggcacc gcgtcaggat 
ttcacactcc tcttctggag ggacgtcgat 
acgaacggtg tcgtcgaaac agcagccctt 
aaaaccacaa ttctgtcttt cacggggggc 
cagggtgccg agggccagca tggacaccag 
ggtggacctc g 



ggaggtggca gccgagctct gggactaatc 60 
gcaggcagat ccctgcagaa gtgtctaaaa 120 
ggtattagga tagaagcacc aggggacccc 180 
atttgcacac tgggagggcg tgacaccagg 240 
cactgtacac gtctctgtct gggcctcggc 300 
gaccagggcg cagatcacct tgttctccat 360 

371 



<210> 222 
<211> 471 
<212> DNA 
<213> Homo sapiens 



<400> 222 

gtccatgttc catcattaat gttccaacat 
agggaaataa ggttagagaa aggatccggg 
ccaacccttg aactcacaaa ccctgaagct 
tcaacataat aagtgcagaa caacatgcca 
caaggtcaaa atccttgtaa ccaaatttaa 
agtaacagaa cacccaagaa ccaaacagaa 
atgaaataaa cttccaaatg gaaaacttgt 
tcccaaagga cataaattcc acttagggca 



caccagggac acaaagctgc aaaaatgaga 60 
caatcttaag gactgaggaa gacatgttcc 120 
caaggattgc atccttcctc caaatctcac 180 
aagcactgta tgaagcacta gggacaaaga 240 
tggtattgta atgcagtgtt aacacaggac 300 
gagggtaggg ataagcataa atgaagtaac 360 
ccataccccc agggcaagtc aactacagtc 420 
cactagacag aaaacaatat t 471 



<210> 223 
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<211> 411 
<212> DNA 

<213> Homo sapiens 



<400> 223 

agttgctcta caatgacaca caaatcccgt 
atttgaagta atgttttagt aaggagagat 
agctaccgat taactaatcg gaacatgtaa 
ttgtcctaca atgaaagccc tcatgtgcag 
tccttgcaaa tgggtgtgac gcggttccag 
aaaggttagc agagcaaagt gcggtgcttt 
gaggctcctg cctgagcttc cttccccagc 

<210> 224 
<211> 321 
<212> 0NA 

<213> Homo sapiens 



taaataaatt ataaacaagg gtcaattcaa 60 
tagaagacaa caggcatagc aaatgacata 120 
aacagttaca aaaataaacg aactctcctc 180 
tagagatgca gtttcat.caa agaacaaaca 240 
atgtggattt ggcaaaacct catttaagta 300 
agctgctgct tgtgccgctg tggcgtcggg 360 
tttgctgcct gagaggaacc a 411 



<220> 

<221> misc_feature 
<222> 31 

<223> n = A,T/C or G 



<400> 224 

ggtctgaagt ttgataacaa agaaatatat 
aaaaacacaa ctatctgttg acataacata 
cttaatctga ttgtccaaat cattaaaata 
tcgtttggct ggatcataga ttaacatttt 
gtttttgaca tgggatgcta ggcttcctgg 
ctgtaaagat tccacttctg g 

<210> 225 
<211> 251 
<212> DNA 

<213> Homo sapiens 



ntaagacaaa aatagacaag agttaacaat 60 
tggaaacttt ttgtcagaaa gctacatctt 120 
tggatgattc agtgccattt tgccagaaat 180 
cgagagcaaa tccaagccat tttcatccaa 240 
tttccatttg ggaaatgtat tcttatagtc 300 

321 



<220> 

<221> mi5C_feature 
<222> 34 

<223> n = A,T^C or G 



<400> 225 

atgtctgggg aaagagttca ttggcaaaag 
gagaggacat gtcactgaat ggggaaaggg 
ccagtaggca ggaagatggc tttgggcagt 
gctccggaac gaggtcatct tctacaggtt 
gatcattctc t 

<210> 226 
<211> 331 
<212> DNA 

<213> Homo sapiens 



tgtnctccca agaatggttt acaccaagca 60 
aacccccgta tccacagtca ctgtaagcat 120 
ggctggatga aagcagattt gagataccca 180 
cttccttcac tgagacaatg aattcagggt 240 

251 



<220> 

<221> raisc__feature 

<222> 26, 34, 35, 36, 37, 39 

<223> n = A,T,C or G 



wo 02/083956 



67 



PCT/US02/12378 



<400> 226 

gttaggtccc aggccccccg ccaagnggtt 

aggcatggca ttaaaacgtt gcaaattcct 

tgtagggtgc agtctttact ccctaacccg 

gacagatgag agagggcagg acttcaggct 

cctggagcac gggaggggag gtgacggctg 

gaggggacta ggaagggcta ttccaggctc 

<210> 227 
<211> 391 
<212> DNA 

<213> Homo sapiens 



accnnnntna ccactcctga cccaaaaatc 60 

ttactgttat cccccccacc accaggacca 120 

tttcccgaaa aaggtgctac ctcctttcca 180 

ggatccacca ctgggctctc cctcccccag 240 

gtgactgatg gatgggtagt gggctgagaa 300 

a 331 



<400> 227 

aggtctgccc ttgaagtata ggaaggaatc 
gctgaagcta gaagtgcaac cccctcctga 
cagcccgcag gaatgttcat atctgagcaa 
ttcaagattg gctccttaaa ccacccacaa 
aagagaacaa agccctcgct gtgatcagga 
ttccaggtat gagacagagt tgaacctgcg 
gcgcagggtg ctggatgagc tgaccctgga 

<210> 228 
<211> 391 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 35 

<223> n = A^T,C or G 



atagttggag gacttctgca ttatttgttg 60 
tttctgcagc aagatgaact gccttatccc 120 
tcaatgggca ctgtgttcaa ccacgccatt 180 
ggcaccagct ctgggagaag ctgcagggag 240 
tgggtgtctc ataccttttc tctggggtca 300 
catgagcgtg gaggccgaca tcaacggcct 360 
c 391 



<400> 228 

gttgtccata gccacctcct gggatagaag 
cttaggacat aggtccagcc ctacagatta 
ggcttagtct ccaccctcag gcatggaacc 
ggagactcag gctgatataa aaataacaaa 
gcttgtctga atagatttga gcaacagtct 
tcctgaatat tcttctggac atcattgctg 
gctgacatct gtcaggtttg gaagtctcat 

<210> 229 
<211> 341 
<212> DNA 

. <213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 202 

<223> n = A,T,C or G 



ctttntagtt catagttcga ttagtgtgtc 60 
gctgggtgaa gaaggcaagt gtctcgacag 12 0 
attcagggtg aagcctggga tgtgggcaca 180 
atcagtaata aaaaaattat aaaacctgtt 240 
tgcttttgtt aaaatcctgg agccgttaag 300 
gctggagaaa ggagccccag gcccggctcg 360 
c 391 



<400> 229 

gtccatggct tctcacccag acagtctttc 

caagtctcac ccca'tggaag agg'tggggga 

tggagagcac gagtcactac aaagcagtaa 

agaacaccgc ggagagcccc anccataaag 

ctttgaatct ctttgattgg tggctccaag 

gactgggttc cctgggaccc cgaggtccca 



tgggcaactt ggggaagccc ctgttctgct 60 
agggggcctt ggtttttcag gaagacgggt 120 
aagtgaatgg tgtctccagg ggctgggtcc 180 
gtgtgttccg cctctggcct gcaggaatct 240 
agcaatggga agtcaacagc caggaggctg 300 
gaggctgctg g 341 
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<210> 230 
<211> 511 
<212> DNA 

<213> Homo sapiens 



<400> 230 

gtccaagcca aggaaaccat tcccttacag 
gggctaaagg aaatggacaa tgcaggacag 
cttcagttgt ctgaagaatg gttttatgcc 
tatagttcac aatagagctc agggagcccc 
tccttcatgc ccaagcctga gctcagatcc 
atgccctact tggaaagatc taagatctga 
atggtgttga atgcaagttt aattaccatg 
aagttcagtt ttagaaaagg gagtctgttc 
aaaggagaca actaactaaa gtagtgagat 



gagacctccc tgtacacaca ggaccgcctg 60 
ctagtgtttc tggctacaga aggggaccat 120 
cacatcatac cattccttgg atgaaacccg 180 
taactcttcc aaaccacatg ggagacagtt 240 
agcttgcaac taatccttct atcatctaac 300 
atcttatcct ttgccatctt ctgttaccat 360 
gagattgttt tacaaacttt tgatgtggtc 420 
cagatcagtg ccagaactgt gcccaggccc 480 



<210> 231 
<211> 311 
<212> DWA 

<213> Homo sapiens 



<400> 231 

ggtccaagta agctgtgggc aggcaagccc 

cccctcgtgt cagctcaggc agctcgaggc 

agttagcgcc ccaccgccgt ggagttcgta 

gctccctgga gccctgttgg cagctctagc 

gtttttctcg cctcactttc caccaagtgt 
ggggtggacc t. 

<210> 232 
<211> 351 
<212> DNA 

<213> Homo sapiens 



ttcggtcacc tgttggctac acagacccct 60 
ccccgaccaa cacttgcagg ggtccctgct 120 
ccgcttcctt agaacttcta cagaagccaa 180 
tttgcagtcg tgtaattggc ccaagtcatt 240 
ctagagtcat gtgagcctcg tgtcatctcc 300 

311 



<400> 232 

tcgtttagct aataatccct tccttgatga 
ctaaaaagcg gttctgtaac tctcaatcca 
ttagtaaatc aagtaaattt catgtcctct 
aaatttttat atgtgttatt gattgtcgtg 
aaaatagcat ttatgatgtc actgttttct 
tatgtaaaca acttttgtat atgcatatag 



tacactccaa cttcttgttt ttctttattt 60 
gagatgttaa aaatgtttct aggcacggta 120 
taaaggacaa acttccagag atttgaatat 180 
taacaaatgg cccccacaaa ttagtagctt 240 
ttgccttttc attaatgttc tgtacagacc 300 
gatagctttt ttgagggtat a 351 



<210> 233 
<211> 511 
<212> DNA 

<213> Homo sapiens 



<400> 233 

aggtctggat gtaaggatgg atgctctcta 
acagccaccc ccagtatgcc gctccaggac 
gaaaatacag gatacccagg gaactttgaa 
agactccaca atcaccaagc taaaggaaaa 
tgcctcccat tctattcaca gtcatccccc 
tttcccctat cgcttctgta aaaatgcaga 
gtggaaggct gatcaagaac tcaaaagaat 
aggaagcccc cacttaaggg ttgtcccacc 
cctactgatt gatgtctcat gtccccctaa 



tacatgctgg gttggggatg ctgggactgc 60 
tctgggacta gggcgccaaa gtgtgcaaat 120 
tttcagattg tgaaaagaaa acaaatcttg 180 
agtcaagctg ggaactgctt agggcaaagc 240 
tgaggctcac ctgcatagct gattgcttcc 300 
ctcactgagc cagactaaat tgtgtgttca 360 
gcaacctttt gtctcttatc tactacaacc 420 
ttactggact gaaccaaggt acatcttaca 480 
a 511 
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<210> 234 

<211> 221 

<212> DNA 

<213> Homo sapiens 



<400> 234 

caggtccagc gaaggggctt cataggctac 

ctctccatca gcatagcctc cgatgaccat 

ccggctgtac atggccctgg tcagccatga 

atctcccaga agotcctcat caatcaccat 

<210> 235 
<211> 381 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 33 

<223> n = A,T,C or G 



accaagcatg tccacataac cgaggaagct 60 
ggtgttccac aaagggttca tcttcgagcg 120 
atgaatagct ctaggactat agctgtgtcc 180 
ctggccgaga c 221 



<400> 235 

ggtccaagaa agggacatct atgtgaaagt 

gctgcagaat aatacattcc caggcactgt 

gacctataac ctctccagaa agaccactct 

aaatatttag acttaacaat cagacaccag 

acacaagtgt gcaaacatac acacacatat 

agccctgccc tgaagtcgtt agtgtctctg 

agctccctca agagacctca g 



ganactgaga cagtgctggt cacaggtcat 60 
cacgtggggg acccaagagg ccccaggagt 120 
gtgtggcatc acagtccaca cagtttaagg 180 
ctcttactca cacttacact cacagcccac 240 
atatttcctg atacattcat ggaatatcag 300 
ctccccaaac cgctgctccc acattggcta 360 

381 



<210> 236 

<211> 441 

<212> DNA 

<213> Homo sapiens 



<400> 236 

aggtcctgtt gcccctttct tttgcccaac 
aacacctgta ctgcattgaa tattggaagc 
cagatggagg aacgtacctt gaagttcaga 
aacagcttga gatttttggg gactactgag 
ggacatgaga tttggtgggc ataggtgtga 
aaatctcttg ttgaatgtga tcttaaacgt 
catgggggtg gactcttcat aatttgctta 
tgctctgttg tgtcacatga g 



ttcgccattt gggaattgga atatttaccc 60 
aaataacttg gctttgatct tataggctca 120 
tgagatttcg gacttttgag ttgatgctga 180 
agatgataat tgtattgtgc aatatgagaa 240 
aatgacattg tttggatgtg tttaccctcc 300 
tggtggtggg cctagtggaa ggtgttgaat 360 
gctccatccc cttggtgatg agcaagtcct 420 

441 



<210> 237 
<211> 281 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 81, 90, 194, 209, 210, 211, 219, 233 

<223> n = A,T,C or G 



<400> 237 

tcctaaaaaa ttagctgacc ttgttaaaaa tgttggcgtg agcagtatat tattacctat 60 
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ctttttttat tgtgtgtgtg ngtgtgtgtn 

ttccctcttt acatttttct tgtttctttc 

ctgtaaagtg aatnttttaa tgaaaacann 

atcagatgta attgagaggc caacaggtaa 

<210> 238 
<211> 141 
<212> DNA 

<213> Homo sapiens 



ttaaactaat tggctgaaat atctgcctgt 120 
cttatttatc tttgtccatc ttgagatcta 180 
nccaagttnt actctcactg ggnttgggac 240 
gtcttcatgt c 281 



<220> 

<221> inisc__feature 

<222> 30, 85 

<223> n = A,T,C or G 



<400> 238 

gtctgcctcc tcctactgtt tccctctatn aaaaagcctc cttggcgcag gttccctgag 60 
ctgtgggatt ctgcactggt gcttnggatt ccctgatatg ttccttcaaa tccactgaga 120 
attaaataaa catcgctaaa g 141 

<210> 239 
<211> 501 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 29, 30, 65, 86, 471, 489 

<223> n = A,T,C or G 



<400> 239 

aacaatctaa acaaatccct cggttctann 
ctgangcttt attcccccac tatgcntatc 
cctaaactat actaaagccc ttttcccatg 
cttgttctag aagtcttaat atgggctgtt 
ttctagctgc ctttattcac atagtgatgg 
gtcgctgtca ttctgtcatt gctgctactc 
atcccctgta tcattccaag aggagcattc 
cttattagaa aacaaactgc ttgacccagg 
tttgctcana tccctgatcc t 

<210> 240 
<211> 451 
<212> DNA 

<213> Homo sapiens 



atacaatgga ttccccatat tggaaggact 60 
ttatcatttt attattatac acacatccat 120 
catggatgga aatggaagat ttttttttaa 180 
gccatgaagg cttgcagaat tgagtccatt 240 
ggtactaaaa gtactgggtt gactcagaga 300 
taacactgag caacactctc ccagtggcag 360 
atccctttgc tctaatgatc aggaatgatg 420 
aacaagtggc ttagcttaag naaacttggc 480 

501 



<400> 240 

tgtcctgaaa ggccattact aatagaaaca 
gtctgggttt ttaatgtctg tggaaaaaaa 
atctattggt ctgaaggttt ctgaacctct 
gatcaggaag gggctgcctc aggaaaattc 
gaaggaccag catgctaatc agtgtcagtg 
agggccaggt ctccccagta ccaagtcctt 
ttagggacca ttgcctgtct tggtcacatg 
atccttgctt aatgcttttg ttgactcaac 



cagcctttcc aatcctctgg aacatattct 60 
ctaaacaagt ctctgtctca gttaagagaa 120 
ttctggttct cagcagaagt aactgaagta 180 
ctagatccta ggaattcagt gagaccctgg 240 
aatccacagt ctttacttcc tgcctcataa 300 
tcctcatgaa gttgtgttgc ctcaggctgt 360 
agtctgtctc cttactttag tccctgggca 420 
q 451 



<210> 241 
<211> 411 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 62, 82, 364, 370, 385 

<223> n = A,T,C or G 



<400> 241 

aatctccagt gtgatggtat cggggttaga 
cnagagcttc aatctccagt gngatggtat 
ggtatcaggg ttagagcttc agcctccagt 
ccagtgtgat ggtatcgggg ttagatcttc 
tctccagtgt gatggtattg gggttagagc 
tggggctttt aagatgtaat tagggtttaa 
agtncgcttn tatgaagaga cacangaggg 

<210> 242 
<211> 351 
<212> DNA 
<213> Homo sapiens 



gcttcaatct ccagtgtgat ggtactgcag 60 

tagggttaga tcttcaatct ccagtgtgat 120 

gtgatggtat cagggttaga gcttcagcct 180 

aatccccagt ggtggtggtt agagcttcaa 240 

ttcaatctcc agtctgatgg tgtttcggga 300 

gatcataagg gacctggtct gatggggatt 360 

cttgctctat ctctgactct c 411 



<400> 242 

ttccccttca caacagtaga gacctacaca 
cctaccaaga ccccagccca actcaagcta 
acagtcacat cacccatcag cacatggaag 
tggtcctgcc cctttgaggg ggtgcaaaca 
agaggctcct gctccacctg ccagtctcgt 
ggggcagagg gagggagcca aggtcactcc 

<210> 243 
<211> 241 
<212> DNA 
<213> Homo sapiens 



gtgaactttg gggacttctg agatcagcgt 60 

cagcagcagc acttcccaag cctgctgacc 120 

gcccctggta tggacactga aaggaagggc 180 

tgactgggac ctaagagcca gaggctgtgt 240 

aagaaatggg gttgctgcag tgttggagta 300 

aataaaacaa gctcatggca c 351 



<400> 243 

gtctgtgctt tatcaggaaa agcacaagaa 
tttaaaaatg gtttgctgaa tttttctatg 
acacgtaaag gatggaacct aatcctctcc 
cttacaacag agaaacaagt acattaatat 
a 



tatgtttttc tacctaaaac cctcttctac 60 
tttttaaaat gtttttatgc ttttttttaa 120 
cgagacgcct cctttgtgtt aatgcctatt 180 
aaaaacgagt tgattattgg ggtataaaat 240 

241 



<210> 244 
<211> 301 
<212> DNA 
<213> Homo sapiens 



<400> 244 

ggtccagagc aatagcgtct gtggtgaagc 
tatatgctgc atccacataa ccatagataa 
gctgtcgagt cagcattcct cccagggttc 
agccagctac catgagatgt gcagacaagt 
ccacatttgc agcagccaaa acaagtggag 

g 



gcctgcactc ctcgggagac atgcctggct 60 
aggtgctgcc ggagccacca atggcaaaag 120 
catatacctg acctccttca cgttggtccc 180 
cctctcgata tttatagctg atatttctca 240 
gttcctccag ttctatccca tggagctcca 300 

301 



<210> 245 
<211> 391 
<212> DNA 
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<213> Homo sapiens 



<400> 245 

ctgacactgc 

gcacctggag 

gtggtgggaa 

tgggaaggga 

ctccagaaag 

ccccaaactg 

gcgtgcaatt 



tgatgtgggc 
ggtaggcagc 
taatcagggt 
aactacaaca 
ggggagtgta 
cataagcacc 
cactcttaca 



cggggggcgc 

ttgtggtgca 
ggcacactgt 
ttcacaacag 
ggaatctaca 
aggattgtac 
gaggagggcc 



cgaggcacaa 
gacaccacag 
gcctagaagc 
aaataggagt 
atttcaaagc 
accttagtcc 
t 



ctggtggccg 
agagagaaaa 
ttccagggcc 
caattcactt 
cagctcgtgt 
ctcaagatag 



gaccattgag 60 
gttggatgga 120 
accaagagaa 180 
agacccagaa 240 
ctacctagag 300 
tttcaagtga 360 
391 



<2X0> 246 
<211> 291 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_fea'ture 

<222> 26, 80, 82, 185, 255, 259 

<223> n = A,T,C or G 



<400> 246 

tcctccacag 

gagctggcaa 

ttcagacata 

tggtncacaa 

agcgttctcg 



gggaagcagg aagttngacc agcttcaggc tggaacgtgc ccagggcaca 60 
ggtgcaaagn cntctgcaga atattcacca ggttgacaca gacctccaca 120 
ttccaagctt ctggggtctt cagggcccca gaatttcctg gtcttgggca 180 
gtcatttgtc cttcctcatt ttggaaggtt ccatttggac ataaaatgca 240 
tgctncatna taataggtcc cagcctgcac tgacacattt g 291 



<210> 247 
<211> 471 
<212> BNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 80, 110, 125, 245, 249, 279, 318, 336, 339, 455, 471 
<223> n = A,T,C or G 



<400> 247 

cactgagtga 

acaacaggag 

gttanaacta 

tgcatacaat 

tacanactnt 

cagaacaaca 

atctctatga 

aaggtagctg 



atgagtatat 
tacatacagn 
tggaaatcgc 
aaatatt-tat 
atcatcacac 
taggcgtntt 
gtctgtctta 
ccttgcttcc 



aatttatgaa 
gaaccaaaaa 
tatgctttgt 
tggataaata 
cactaatcta 
■tggctccatt 
tcttctggaa 
agttctacca 



aacagaaaag 
gagtgtacca 
gttgtcacag 
actaagcctg 
agttctcana 
-taacanaana 
aaggggccta 
tcctntagca 



tgctttggaa 
ggaggagcan 
gagttaaaat 
ataccctttt 
agttaaacat 
aggaccatag 
acaccatttc 
acccatcttt 



aaaaaaaaag 60 
accctgaaca 120 
aggaataccc 180 
caatgcgtta 240 
tacaagactt 300 
tgatcattta 360 
cttttgcaaa 420 
n 471 



<210> 248 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<400> 248 

ccatgggatc aggaatgggg tcaggtcagt tgacctgagc atacccatta aacatgttca 60 

aatgtcccca tcccacccac tcacatgaca tggctcccga gccctgagat ctgtatccca 120 

agaacctcag ttgagaaata tttatggcag cttcactgtt gctcaagagc ctgggtattg 180 

tagcagcctg ggggcagg-tt gtccctaatg ttctccaagt tcttcacatc agccagaatc 240 
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ccatctatgc ttgtctccag caaatggagg 
agctctgaca tcatgggccg cagttggctg 
tgctccagta agaccagccc ctcttcatct 
aggccgtcta atgtgttgag tgtgtcttgg 
tcaaccttct gggcttctgt aatcaccatc 
agctccttcc t 

<210> 249 
<211> 181 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 3, 96 

<223> n = A,T,C or G 



tggcccctct gctgacgtgc cctctcttcc 300 
ttgatctggg tcttggctcg ggaaagcttc 360 
acactgagag gctggtccat cagatgcagg 420 
attgtaaccc cagcgttctt ggctctggta 480 
tgtactgcat ccatattcgt gtcgaactcc 540 

551 



<400> 249 

atntccagag ggaccgtaag actggtacaa 
cacaatgtct tcacctccac aggggctcat 
cagagctttg tttaggatca tcctcttccc 

g 



gtttacacca taagaggcga cgtggtcagc 60 
cacggnggtc agggcaaggg cccccagcat 120 
aaggcagcct tagcagttgc tgacctgccc 180 

181 



<210> 250 
<211> 551 
<212> DNA 
<213> Homo sapiens 



<400> 250 

tctgtagcta ggatgagctg gctctcaagc 
ttatcacttg ttattgaatg tacatcacaa 
atgtgccgac tttggtaacc aggagatttc 
tctgctatga cacatccgca atgaaaaata 
aaaataggag gtaattcttc tgcatccaaa 
agtcgaatgg tctcggaatc tgatccgttt 
tttcctgggt atagattgac cactaaacat 
aacaagttca catttcttct taatttctta 
tacaagtcat aaagtttttg aaatacagat 
ggaggtctct g 

<210> 251 
<211> 441 
<212> DNA 
<213> Homo sapiens 



aaaagtttgt cttcctgggt ccatttgtgg 60 
attaaagtct gcattgttgg acgtaagaga 120 
atgttactgg actgcctgta gtcacgtatt 180 
ttaacctgag atttttctag gagatcaacc 240 
tattcaagca actctccttc ttcatagggc 300 
tttcccctga gcatcagaga atatccctca 360 
gacaaagtct cttgcataac aagcttctct 420 
acttcaggtt ctttttcaca ttcttcaata 480 
tttcttccac ttgataggta tttcctttta 540 

551 



<400> 251 

tgtctgctct cccatcctgg ttactatgag 
gagcttggca ctcgctccaa ctttgccgaa 
aacacaattt tagcagcagt gaaataaaaa 
tataattaaa ctgtctgttt aggagaagct 
agagaggaca gggagcaggg cctttggaga 
gctcggactc tgccctgaaa gatttgaatt 
cgtactccaa gcacttttct cacggcagag 
gtttgtgggg ccagcgatgt g 



tcgctcttgg cagaaaggac cacagatgga 60 
aagaggacaa ccaccaaagt agtaggtaaa 120 
gaggaagtga ggatggggcc aggccgcaac 180 
gaatccagaa gaaacacaag ctgtaaagtg 240 
gcaggagagg acaggctgtc accaagcgct 300 
ggacactgtc cagtcacgtg tgtggcaaac 360 
gaaggagctg ccatggctgt acccctgaac 420 

441 



<210> 252 
<211> 406 
<212> DNA 
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<213> Homo sapiens 



<400> 252 

tttttttttg aacaagtaaa aatttcttta tttgctgaca ataagataac ctacagggaa 60 

aacctgatga aatctattaa aaagttacta aaactaataa aagaatttag gaaggttata 120 

gaatgtaaga ccaagacaca aaaatcaatt acatttctat ataatagcaa tgaacagata 180 

ctgaaatttt aaaaactaaa tcattttaca aaagtatcac aatatgaaac actccgggat 240 

aaattggata aaagatgtgc aagactgtac aaaagctaca aaacatttat gaaggaaatt 300 

ggaagataga aacaagatag aaaatgaaaa tattgtcaag agtttcagat agaaaatgaa 360 

aaacaagcta agacaagtat tggagaagta tagaagatag aaaaat 406 

<210> 253 
<211> 544 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 224 

<223> n = A,T,C or G 



<400> 253 

gaaggagttc agtagcaaag tcacacctgt 
atgggatggc aaaggtggtg gtgctttcat 
caagggctgc aggcccagtt ctcatgctgc 
acgtctgggt ggcggcagca gctttgctct 
gaaacatcat cattattaaa cttcagaaaa 
gcctcactgc ttaagtgcct gcaggagccg 
cactggggtc tgcacaaggc tttgtcaacc 
agactttgga gccaagaaac actctgtgtg 
tcaaagtcat tgatgcaact tgaaaggaaa 
ataa 



ccaattccct gagctttgct cactcagcta 60 
cttcaggcag aagcctctgc ccatccccct 120 
ccttgggtgg gcatctgtta acagaggaga 180 
gagtgcctac aaanctaatg cttggtgcta 240 
gcagcagcca tgttcagtca ggctcatgct 300 
cctgccaagc tccccttcct acacctggca 360 
aaagacagct tccccctttt gattgcctgt 420 
actctacaca cacttcaggt ggtttgtgct 480 
cagtttaatg gtggaaatga actaccattt 540 

544 



<210> 254 
<211> 339 
<212> DNA 
<213> Homo sapiens 



<400> 254 

tggcattcag ggcagtgtct tctgcatctc 
cctggtagcg agaggcgggt tccggagatc 
ggtgagtcct gcaaatgtta agtgatttgc 
ccaggccagt tctctgagcc tatcattagg 
tggacggttc tacttgtcct gcctgctgct 
cacta'tctac tggatgcagt actggcgtgg 



ctaggaacct cgggagcggc agctccggcg 60 
ccggcctcac ttcgtcccac tgtggttagg 120 
tcaaggtgcc catttcgcag gaattggagc 180 
gctaaaggag tgcgtgatca gaatggtgtc 240 
ggggtccctg ggctctatgt gcatcctctt 300 
tggctttgc 339 



<210> 255 
<211> 405 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> mis cofeature 

<222> 11, 39, 70, 87, 103, 120, 177, 181, 220, 229, 233, 341, 345, 

366, 380, 402 

<223> n - A,T,C or G 



<400> 255 
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gaggtttttt nttttttttt tttttttttt 
gaaaacattn tacaatggaa acttttntta 
cacatacagc catgctgttt caaaaaactt 
ncccgatgga aaatcgagga aaacaattta 
attaaatgac agctccactt ggcaaataat 
tggttgggga tggataaatt caaaaatgct 
tcaggncaca acccttgcan aaaacactga 

<210> 256 
<211> 209 
<212> DNA 
<213> Homo sapiens 



caattaaana tttgatttat tcaagtatgt 60 
aatgctgcat gtnctgtgct atggaccacn 120 
gaaatgccat tgatagttta aaaactntac 180 
atgtttcatn tgaatccana ggngcatcaa 240 
agctgttact tgatggtatc caaaaaaaaa 300 
tccccaaagg ngggnggttt ttaaaaagtt 360 
tgcccaacac antga 405 



<220> 

<221> misc_feature 
<222> 6 

<223> n = A,T,C or G 



<400> 256 

gggcangtct ggtcctctcc ccacatgtca 
tctccctcct cccctgccct agcccaggga 
ggaggcagga agagagcact ggacagacag 
agtaggttct taaagaccct tttttagta 

<210> 257 
<211> 343 
<212> DNA 
<213> Homo sapiens 



cactctcctc agcctctccc ccaaccctgc 60 
cagagtctag gaggagcctg. gggcagagct 120 
ctatggtttg gattggggaa gaggttagga 180 

209 



<220> 

<221> misc_feature 
<222> 306, 311, 343 
<223> n = A,T,C or G 



<400> 257 

tctggacacc 

gggttaagtc 

tattgagact 

ttgggattcg 

ctctttgcct 

gtcttntctc 



ataatccctt 
aataggttga 
caaaggctta 
gatgtaaaat 
ttctccttta 
nggtttcttg 



ttaagtggct 
ctaggatcaa 
tactggcgtc 
ggagtctggc 
tggcctctgc 
gagccgggat 



ggatggtcac 
cacgacccaa 
tgaaactatg 
ctccctcaaa 
cacattttct 
tcggctttaa 



acctctccca 
atcaataaga 
tccttcgtta 
gcccaagcgg 
acctcttctc 
gtn 



ttgacaagct 60 
tactgcagtc 120 
aacccgtatt 180 
ggccgggttc 240 
cgacctcttg 300 
343 



<210> 258 
<211> 519 
<212> DNA 

<213> Homo sapiens 



<400> 258 

gcggcttctg 

gctgataccc 

ccattcaccc 

ggaacaggga 

acccacaggg 

ttagatcctt 

cttctaccag 

cccttaatca 

ttgcgttttg 



acttctagaa 
agagaacctg 
agcgggaggt 
tttgcccttc 
caggacccta 
ttctacccac 
ggtccaggac 
cccttgctcc 
tgctttgatg 



gactaaggct 
ggcacttgct 
gggatgtgag 
acaattctac 
agatctgggg 
tttcctatgg 
taaggcgttt 
tcctgggtgc 
ccaggaatgc 



ggtctgtgtt 
gcctgatgcc 
acagcccaca 
tccccagatc 
aaaggaggtc 
aggattccaa 
tc tec at age 
ctggaagatg 
cgcetagtt 



tgcttgtttg 
cacccctgcc 
ttggaaaatc 
ctctcccctg 
ctgagaacct 
gtcaccactt 
ctcaacattt 
gactggcaga 



cccacctttg 60 
agtcattcct 120 
cagaaaaecg 180 
gacaeaggag 240 
tgaggtaccc 300 
ctctcaccgg 360 
tgggaatctt 420 
gacctctttg 480 
519 
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<210> 259 
<211> 371 
<212> DNA 
<213> Homo sapiens 



<400> 259 

attgtcaact atatacacag tagtgaggaa 
tgaaaatgtc ttctaaatat gaccagtcta 
tcttccagct ccatgtcatc taacccactt 
cgtcttaaca actccatttc caaaagtcat 
ttttaaacaa atgagaattt acaagatgtg 
ggcaacctct ttccatctag aagggctaga 
ggtggagttg a 

<210> 260 

<211> 430 

<212> DNA 

<213> Homo sapiens 



taaaatgcac acaaaacaat ggatagaata 60 

gcatagaacc ttcttctctt ccttctcagg 120 

aacaaacgtg gacgtatcgc ttccagaggc 180 

ctccagaaga catgtatttt ctatgatttc 240 

taactttcta actctatttt atcatacgtc 300 

tgtgacaaat gttttctatt aaaaggttgg 360 

371 



<220> 

<221> misc_feature 

<222> 57, 189, 208, 256, 426 

<223> n = A,T,C or G 



<400> 260 

ttggattttt tgacttgcga tttcagtttt 
tactatattt attgtcaaag agtggtacat 
gtatactctg cttcgctgtt tcagtaaaag 
caccataana caagcgcaag tcactcanaa 
ttgaggaaag actganaaag cctatcccaa 
ctacatgttc ccgtttttct gccaatctac 
gagtcacttc ctgctatgtg aacaaaaacc 
tctcgngagc 

<210> 261 

<211> 365 

<212> DNA 

<213> Homo sapiens 



tttacttttt tttttttttt ttttganaaa 60 
aggtgagtgt tcatcttccc tctcatgccg 120 
ttttccgtag ttctgaacgt cccttgacca 180 
ttgccactgg aaaactggct caactatcat 240 
agtaatggac atgcaccaac atcgcggtac 300 
ctgtgtttcc aagataaatt accacccagg 360 
cggtttcttt ctggaggtgc ttgactactc 420 

430 



<220> 

<221> misc_feature 
<222> 178 

<223> n = A,T,C or G 



<400> 261 

tcctgacgat agccatggct gtaccactta 
atatcacaaa atgacttgta cacagtagtt 
aaaaaagacg cctcaaaatt cactcaactt 
aagctatgct gcaactgagg gcacatatca 
gctggaaaaa atctcatact aagcaaacag 
tatctgctca gcctgccgct aacagatctc 
ccaaa 



actatgattc tattccaact gttcagaatc 60 
tacaacgact cccaagagag gaaaaaaaaa 120 
ttgagacagc aatggcaata ggcagcanag 180 
ttgaagatgt cacaggagtt taagagacag 240 
tagtatctca taccaagcaa aaccaagtag 300 
acaatcacca actgtgcttt aggactgtca 360 

365 



<210> 262 
<211> 500 
<212> DNA 
<213> Homo sapiens 
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<400> 262 

cctagatgtc atttgggacc cttcacaacc attttgaagc cctgtttgag tccctgggat 60 

atgtgagctg tttctatgca taatggatat tcggggttaa caacagtccc ctgcttggct 120 

tctattctga atccttttct ttcaccatgg ggtgcctgaa gggtggctga tgcatatggt 180 

acaatggcac ccagtgtaaa gcagctacaa ttaggagtgg atgtgttctg tagcatccta 240 

tttaaataag cctattttat cctttggccc gtcaactctg ttatctgctg cttgtactgg 300 

tgcctgtact tttctgactc tcattgacca tattccacga ccatggttgt catccattac 360 

ttgatcctac tttacatgtc tagtctgtgt ggttggtggt gaataggctt ctttttacat 420 

ggtgctgcca gcccagctaa ttaatggtgc acgtggactt ttagcaagcg ggctcactgg 480 

aagagactga acctggcatg 500 

<210> 263 
<211> 413 
<212> DNA 
<213> Homo sapiens 

<400> 263 

ctcagagagg ttgaaagatt tgcctacgaa agggacagtg atgaagctaa gctctagatc 60 

caggatgtct gacttcaaat tgaaactccc aaagtaatga gtttggaagg gtggggtgtg 120 

gcctttccag gatgggggtc ttttctgctc ccagcggata gtgaaacccc tgtctgcacc 180 

tggttgggcg tgttgctttc ccaaaggttt tttttttagg tccgtcgctg tcttgtggat 240 

taggcattat tatctttact ttgtctccaa ataacctgga gaatggagag agtagtgacc 300 

agctcagggc cacagtgcga tgaggaccat cttctcacct ctctaaatgc aggaagaaac 360 

gcagagtaac gtggaagtgg tccacaccta ccgccagcac attgtgaatg aca 413 

<210> 264 
<211> 524 
<212> DNA 
<213> Homo sapiens 

<400> 264 

tccaatgggg ccctgagagc tgtgacagga actcacactc tggcactggc agcaaaacac 60 
cattccaccc cactcatcgt ctgtgcacct atgttcaaac tttctccaca gttccccaat 120 
gaagaagact catttcataa gtttgtggct cctgaagaag tcctgccatt cacagaaggg 180 
gacattctgg agaaggtcag cgtgcattgc cctgtgtttg actacgttcc cccagagctc 240 
attaccctct ttatctccaa cattggtggg aatgcacctt cctacatcta ccgcctgatg 300 
agtgaactct accatcctga tgatcatgtt ttatgaccga ccacacgtgt cctaagcaga 360 
ttgcttaggc agatacagaa tgaagaggag acttgagtgt tgctgctgaa gcacatcctt 420 
gcaatgtggg agtgcacagg agtccaccta aaaaaaaaaa tccttgatac tgttgcctgc 480 
ctttttagtc accccgtaac aagggcacac atccaggact gtgt 524 

<210> 265 
<211> 344 
<212> DNA 
<213> Homo sapiens 

<400> 265 

tcctttcttc tacttcagga gatgattcaa agttacttgt ggacatttct ttaagttctg 60 
aagacaaatg agacaggatt tggcctgcgg gttcttcaga cttctctacc acctccatta 120 
actcttcatc ttggcttgac gtaggcaatg cactattttg ctcttttgtt tctggagatg 180 
acccagcacc acttctttct cttggcgggg ttctaagtgt gtctttgaat accagtgaag 240 
actcaggcct atcctgtact ggaaagggac taaatttgtc tttctgtcta ggaggtgatg 300 
cagtagcatc ctcctgaggg ggtaaggcca ttttctcttt ttga 344 

<210> 266 
<211> 210 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> irdsc__feature 
<222> 78 

<223> n = A,T,C or G 



<400> 266 

ccacaatgtc cataacttga gcaggctttg gcatcccacc acccccttca gaccaataca 60 

cactatgttg gaggaacnac tttaaaatgt aaaatgagaa atgggcactg aacactccat 120 

cctcactccc aacagcccac ccacacacct cttcaactgc tatccaaaca tggaggagct 180 
cttgtggaag agaggctcaa caccaaataa 210 



<210> 267 
<211> 238 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 5, 19, 31 
<223> n = A,T,C or G 



<400> 267 

tcggncctcc caccctctna 
caaccccagg catgtaccct 
acagtggctg ttctcgctgg 
tggaactggc ccctcagcca 

<210> 268 
<211> 461 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 459 

<223> n = A,T,C or G 



ctgaaattct ntgaaattct 
cccaacctgg gacccgacct 
gcaggcgtcc caaagcacat 
tcagtggagg tggcctggga 



cccctttggg atgaggatgg 60 
aataccctaa catcctgctg 120 
cgagccagat tcaggcagag 180 
ggctctaccc tgaacggg 238 



<400> 268 

tcctcaagga catgcccctt gatagaaact 
tgatccccca aatgcagggc ctgggactat 
acaagatgca cagcaaataa gtgctgaata 
ccaaacattc accaagtcct cagcaaagag 
ctgagctccc cagtctatac cccaagatat 
tctccaagcc agtttcccct ttctaagtat 
cttttcctac ctctccccag tgactaggtt 
cagcagaagc tccataaatg tgtgctgacc 

<210> 269 
<211> 434 
<212> DNA 

<213> Homo sapiens 



cagttcctgt ctccagttcc ctcctggacc 60 
atccagttcc ttattttcag aggcccatgc 120 
aagacccagc tactgctagc ttaccctgct 180 
ggccatccat tcacctcttc taaaaacaca 240 
gcttggctcc caactatccc tcctctctca 300 
actgatatta ccaaagacac tgacaatctt 360 
tgcagcagga gctctataag tcctagtata 420 
taacattang c 461 



<400> 269 

ctgtgttggt gagcaccgat 
cccgctccca accagaatga 
ccacaatctc cattttgatt 
cttcagtaag gtccatatca 



tcccactcaa tatggcgtgg 
ggaatgatca cttcatctgt 
gagtcatggg atgaaagatt 
cgagcctttc gaagcaatcg 



cttacagtct tcattaggtt 60 
caaggcatgc agtgcatggt 120 
ccacagggtt ccggtaataa 180 
cacaagggca ggcacaccat 240 
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cacagttttt tatggcaatc ttgttatcct 
ctccacaggc tccaaggtgc acttcctttt 
ggatgccctt gagcttccgc acgtcagtct 
gcaggtatgc aaga 

<210> 270 
<211> 156 
<212> DNA 
<213> Homo sapiens 



ggtcacgtcc aaaagagata ttcttgagag 300 
tgggatggtc taacaatccc accagtactg 360 
tcaccttgtc attgcggtag cataagtgtt 420 

434 



<400> 270 

ctgcaccagc gattaccagt ggcattcaaa tactgtgtga ctaaggattt tgtatgctcc 60 
ccagtagaac cagaatcaga caggtatgag ctagtcaaca gcaagtcttt gttggattcg 12 
agtaggctca ggatctgctg aaggtcggag gagtta 15 

<210> 271 
<211> 533 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 100, 137, 383, 385, 411 

<223> n = A,T,C or G 



<400> 271 

ccactgtcac ggtctgtctg acacttactg 
tgaagaactt agaagctgtg gagaccttgg 
ctggaactct gactcanaac cggatgacag 
atgaagctga tacgacagag aatcagagtg 
ggcttgctct gtccagaatt gcaggtcttt 
aaaacctacc tattcttaag cgggcagttg 
agtgcataga gctgtgctgt ggntncgtga 
tcgagatacc cttcaactcc accaacaagt 
catcggagcc ccaacacctg ttggtgatga 

<210> 272 
<211> 630 
<212> DNA 
<213> Homo sapiens 



ccaaacgcat ggcaaggaaa aactgcttag 60 

ggtccacgtn caccatctgc tctgataaaa 120 

tggcccacat gtggtttgac aatcaaatcc 180 

gtgtctcttt tgacaagact tcagctacct 240 

gtaacagggc agtgtttcag gctaaccagg 300 

caggagatgc ctctgagtca gcactcttaa 360 
aggagatgag agaaagatac nccaaaatcg 420 

accagttgtc tattcataag aaccccaaca 480 

agggcgcccc agaaaggatc eta 533 



<400> 272 

tggtattttt ctttttcttt tggatgtttt 
ttttcttcgc cttcccgtac ttctgtcttc 
caaattgttt catcctacca ctcccaatta 
tgcgttaact aggctttaaa tgacgcaatt 
aatcaccttc ggtttaatct ctttttaaaa 
aacttttaaa ctaaacttta agctgtttaa 
ccttctattg gtattaattc ggggctctgt 
attttttact ccatgaagaa gcttcatctc 
cttttccttc ctcatgctac tcttctaagt 
ttaaaattac gttaaaaact taacgctaag 
tttaagtagt tgtattaatc tctatctttc 



atactttttt ttcttttttc ttctctattc 60 
cagttttcca cttcaaactt ctatcttctc 120 
atctttccat tttcgtctgc gtttagtaaa 180 
ctccctgcgt catggatttc aaggtctttt 240 
gatcgccttc aaattatttt aatcacctac 300 
gtcaccttca ttttaatcta aaagcattgc 360 
agtcctttct ctcaattttc ttttaaatac 420 
aacctccgtc atgttttaga aaccttttat 480 
cttcatattt tctcttaaaa tcttaagcta 540 
caatatctta gtaacctatt gactatattt 600 

630 



<210> 273 
<211> 400 
<212> DNA 
<213> Homo sapiens 
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<400> 273 

tctggtttgc cctccagttc attctgaatc tagacttgct cagcctaatc aagttcctgt 60 

acaaccagaa gcgacacagg ttcctttggt atcatccaca agtgaggggt acacagcatc 120 

tcaacccttg taccagcctt ctcatgctac agagcaacga ccacagaagg aaccaattga 180 

tcagattcag gcaacaatct ctttaaatac agaccagact acagcatcat catcccttcc 240 

tgctgcgtct cagcctcaag tatttcaggc tgggacaagc aaacctttac atagcagtgg 300 

aatcaatgta aatgcagctc cattccaatc catgcaaacg gtgttcaata tgaatgcccc 360 
agttcctcct gttaatgaac cagaaacttt aaaacagcaa 400 

<210> 274 

<211> 351 

<212> DNA 

<213> HoKio sapiens 

<220> 

<221> misc_feature 
<222> 2 

<223> n ^ A,T,C or G 
<400> 274 

tntgagtatg tcccagagaa ggtgaagaaa gcggaaaaga aattagaaga gaatccatat 60 
gaccttgatg cttggagcat tctcattcga gaggcacaga atcaacctat agacaaagca 120 
cggaagactt atgaacgcct tgttgcccag ttccccagtt ctggcagatt ctggaaactg 180 
tacattgaag cagaggttac tattttattt tattttttct tatatcagta ttgcagcatt 240 
cactgtagtg atagaaaaca agttaggaac atagccaatt aggacaagga ggatttaaat 300 
gtgtcttacc tttattttgt aaaataggta taaaggagta attaaaatga a 351 

<210> 275 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 4, 11, 12, 13 
<223> n = A,T,C or G 

<400> 275 

gcgnggtcgc nnncgaggtc tgagaagccc ataccactat ttgttgagaa atgtgtggaa 60 

tttattgaag atacagggtt atgtaccgaa ggactctacc gtgtcagcgg gaataaaact 120 

gaccaagaca atattcaaaa gcagtttgat caagatcata atatcaatct agtgtcaatg 180 

gaagtaacag taaatgctgt agctggagcc cttaaagctt tctttgcaga tctgccagat 240 

cctttaattc catattctct tcatccagaa ctattggaag cagcaaaaat cccggataaa 300 

acagaacgtc ttcatgcctt gaaagaaatt gttaagaaat ttcatcctgt aaactatgat 360 

gtattcagat acgtgataac a 331 

<210> 276 
<211> 390 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 5 

<223> n = A,T,C or G 
<400> 276 
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gctcngactc cggcgggacc tgctcggagg 
cctgttggcc ctgacaatca tagccagcac 
caagcatgac gtggagagac taaaagcctc 
tgccttctac tccatcgtgg gactcagcag 
agcatgtacc tacatcagat ctaaccttga 
tgcccaggcc agccaggccc tctcaggatg 
tctgcttctg gcagacctcg gccgcgacca 

<210> 277 

<211> 378 

<212> DNA 

<213> Homo sapiens 



aatggcgccg ccgggttcaa gcactgtctt 60 
ctgggctctg acgcccactc actacctcac 120 
gctggatcgc cctttcacaa atttggaatc 180 
ccttggtgct caggtgccag atgcaaagaa 240 
tcccagcaat gtggattccc tcttctacgc 300 
tgagatctct atttcaaatg agaccaaaga 360 

390 



<400> 277 

tgggaacttc tggggtagga cgttgtctgc 
cgatggtttt ggaccattta tgccgggatt 
actggagcgt gctcttcagg atccaaatgt 
tgaagcaggc gttgttgttc cggatccagg 
caggcaccag gttctcttta ttgctgatga 
atggctggct gttgattatg aaaatgtcag 
ttctgggggc ttataccc 

<210> 278 
<211> 366 
<212> DNA 
<213> Homo sapiens 



tatctccagt tccacagacc caaccagtta 60 
cgacatcatt ccctataatg atctgcccgc 120 
ggctgcgttc atggtagaac caattcaggg 180 
ttacctaatg ggagtgcgag agctctgcac 240 
aatacagaca ggattggcca gaactggtag 300 
acctgatata gtcctccttg gaaaggccct 360 

378 



<400> 278 

ggagggcaca ttccttttca cctcagagtc 
tgaccaaacc agtgatgctg tccttgatgc 
agcttgtgaa actgttgcta aaactggaat 
agctgctgtt gactaccaga aagtggttcg 
ttcttccaaa ggttttgact acaagacttg 
accagatatt gctcaaggtg ttcatcttga 
ccaggg 

<210> 279 
<211> 435 
<212> DNA 
<213> Homo sapiens 



ggtcggggaa ggccacccag ataagatttg 60 
ccaccttcag caggatcctg atgccaaagt 120 
gatccttctt gctggggaaa ttacatccag 180 
tgaagctgtt aaacacattg gatatgatga 240 
taacgtgctg gtagccttgg agcaacagtc 300 
cagaaatgaa gaagacattg gtgctggaga 360 



<400> 279 

cctaagaact gagacttgtg acacaaggcc 
tggaagacct acaacccaag gatggaaggc 
aggacccaag cgaaaaagat atctcaagac 
agaacccagg aaggatagaa gcttgaagac 
ccctacctct tttctattgt ttacacttct 
atctttaagc ctgattcttt tgagatgtac 
acaagtatta tgcctggcca gtcttgagcc 
taggctatag tgttt 

<210> 280 
<211> 435 
<212> DNA 
<213> Homo sapiens 



aacgacctaa gattagccca gggttgtagc 60 
ccctgtcaca aagcctacct agatggatag 120 
taacggccgg aatctggagg cccatgaccc 180 
ctggggaaat cccaagatga gaaccctaaa 240 
tactcttaga tatttccagt tctcctgttt 300 
tttttgatgt tgccggttac ctttagattg 360 
agctttaaat cacagctttt acctatttgt 420 

435 



<400> 280 

tctggatgag ctgctaactg agcacaggat gacctgggac ccagcccagc caccccgaga bu 
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cctgactgag gccttcctgg caaagaagga 

caatgatgag aacctgcgca tagtggtggg 

ctcgaccacg ctggcctggg gcctcctgct 

gcccagacct gtccgggcgg ccgctcgaaa 

atccgagctc ggtaccaagc ttggcgtaat 

gttatccgct cacaattcca cacaacatac 

gtgcctaatg agtga 

<210> 28X 
<211> 440 
<212> DNA 
<213> Homo sapiens 



gaaggccaag gggagccctg agagcagctt 120 

taacctgttc cttgccggga tggtgaccac 180 

catgatccta cacctggatg tgcagcgtga 240 

ttccagcaca ctggcggccg ttactagtgg 300 

catggtcata gctgtttcct gtgtgaaatt 360 

gagccggaag cataaagtgt aaagcctggg 420 

435 



<400> 281 

catctgatct ataaatgcgg tggcatcgac 
gctgctgaga tgggaaaggg ctccttcaag 
gagcgtgaac gtggtatcac cattgatatc 
tatgtgacta tcattgatgc cccaggacac 
acatctcagg ctgactgtgc tgtcctgatt 
ggtatctcca agaatgggca gacccgagag 
aaacaactaa ttgtcggtgt taacaaaatg 
gatatgagga aattgttaag 



aaaagaacca ttgaaaaatt tgagaaggag 60 

tatgcctggg tcttggataa actgaaagct 120 

tccttgtgga aatttgagac cagcaagtac 180 

agagacttta tcaaaaacat gattacaggg 240 

gttgctgctg gtgttggtga atttgaagct 300 

catgcccttc tggcttacac actgggtgtg 360 

gattccactg agccccctac agccagaaga 420 

440 



<210> 282 
<211> 502 
<212> DNA 
<213> Homo sapiens 



<400> 282 

tctgtggcgc aggagccccc tcccccggca gctctgacgt ctccaccgca gggactggtg 60 

cttctcggag ctcccactcc tcagactccg gtggaagtga cgtggacctg gatcccactg 120 

atggcaagct cttccccagc gatggttttc gtgactgcaa gaagggggat cccaagcacg 180 

ggaagcggaa acgaggccgg ccccgaaagc tgagcaaaga gtactgggac tgtctcgagg 240 

gcaagaagag caagcacgcg cccagaggca cccacctgtg ggagttcatc cgggacatcc 300 

tcatccaccc ggagctcaac gagggcctca tgaagtggga gaatcggcat gaaggcgtct 360 

tcaagttcct gcgctccgag gctgtggccc aactatgggg ccaaaagaaa aagaacagca 420 

acatgaccta cgagaagctg agccgggcca tgaggtacta ctacaaacgg gagatcctgg 480 

aacgggtgga tggccggcga ct 502 

<210> 283 
<211> 433 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 130, 147, 221, 225, 242, 246, 261, 279, 292, 294, 298, 314, 
323, 332, 339, 342, 343, 350, 351, 356, 361, 362, 368, 372, 
375, 379, 380, 382, 387, 390, 392, 394, 401, 404, 406, 409, 
413, 423, 431, 433 
<223> n = A,TrC or G 



<400> 283 

ccatattaga ttactggaac atctaagcat 
ggggagtgtc tatttttaaa aaggtttatg 
cagaatcaan cccactttta ggcttangac 
ataacaaaaa gtgttctaaa tgtggctatt 
antgtntata cagaaagagt ntaaaagttc 



cagtgtgtga ccatgcgaac aaaagacttc 60 
tgtgtcgagg cagttgtaaa agatttactg 120 
caggttctaa ctatctaaaa atattgactg 180 
ctgatccata nttgnttttt aaagaaaaaa 240 
tgtgaattna atgcaaatta gncnccantc 300 
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ttgacttccc aaanacttga ttnatacctt 
nntcaatnat tnggnagtnn anggccntcn 
canccgcctt nan 

<210> 284 
<21X> 479 
<212> DNA 

<213> Homo sapiens 



tnactcctnt cnnttcctgn ncttcnttaa 360 
gnanaacacc nttncncgnt ccncgcaatc 420 

433 



<400> 284 

tctggaagga tcagggatct gagcaaagcc 
gggtcaagca gtttgttttc taataagcat 
tgctcctctt ggaatgatac aggggatctg 
tagcagcaat gacagaatga cagcgactct 
tcactatgtg aataaaggca gctagaaaat 
agcccatatt aagacttcta gaacaagtta 
gggaaaaggg ctttagtata gtttaggatg 
atgcatagtg ggggaataaa gcctcagagt 

<210> 285 
<211> 435 
<212> DNA 

<213> Homo sapiens 



aagtttactt aagctaagcc acttgttcct 60 

cattcctgat cattagagca aagggatgaa 120 

ccactgggag agtgttgctc agtgttagag 180 

ctgagtcaac ccagtacttt tagtaccccg 240 

ggactcaatt ctgcaagcct tcatggcaac 300 

aaaaaaaatc ttccatttcc atccatgcat 360 

gatgtgtgta taataataaa atgataagat 420 

ccttccagta tggggaatcc attgtatct 479 



<220> 

<221> misc_feature 

<222> 27, 83, 90, 93, 96, 184, 207, 227, 232, 293, 306, 307, 328, 

331, 339, 343, 347, 349, 350, 370, 371, 382, 383, 414, 418, 

434 

<223> n = A,T,C or G 



<400> 285 

tttttttttt tttttttttt tcaatanaaa 
tcatccttat gtaacaaaat gtnttcttan 
taatcagcaa acatacaact gttggcaact 
cagngctgaa aacaaacctg cttaaanata 
caaaaattga ggtattttgg ttcttctagg 
cccctnnccc aaaaaataaa ataagggnat 
cggggaaaan naaaccttgg gnngggggtt 
tcccaaaaaa aacng 

<210> 286 
<211> 301 
<212> DNA 

<213> Homo sapiens 



tgccataatt tattccattg tataaaaaag 60 
aanaanaaat atattatttc aggtcataaa 120 
aaaaaaaaac ccaacactgg tattttccat 180 
tatttacagg gatagtncag tnctcaaaaa 240 
agtagacaat gacattttgg gangggcaga 300 
nttcttcant atngaanann gggggcgccc 360 
tggcccaagc ccttgaaaaa aaantttntt 420 

435 



<400> 286 

cctggtttct ggtggcctct atgaatccca 
ctgtgagcac cacgtcaacg gctcccggcc 
gtgtagcaag atctgtgagc ctggctacag 
atacaattcc tacagcgtct ccaatagcga 
cggccccgtg gagggagctt tctctgtgta 

g 

<210> 287 
<211> 432 
<212> DNA 
<213> Homo sapiens 



tgtagggtgc agaccgtact ccatccctcc 60 
cccatgcacg ggggagggag atacccccaa 120 
cccgacctac aaacaggaca agcactacgg 180 
gaaggacatc atggccgaga tctacaaaaa 240 
ttcggacttc ctgctctaca agtcaggagt 300 

301 
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<400> 287 

tccagcttgt tgccagcatg agaaccgcca 
atgacttcag agttgccagc caagtcagcg 
acggcaccag ctttgcagac ggcaagggac 
gatttatttt ctgttccatc catctcgatc 
acgttcagtt tcttgctaac cagggcaggc 
tttgagacac ccttccccat atagcgagtc 
tagataccag ttgaagcacc actgggcaca 
agatcaacct ca 

<2a0> 288 
<211> 326 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 254 

<223> n = A,T,C or G 



ttgatgacat tgaacgccgg gactggcagg 60 
atgtggcggt acagggggac ccccttctca 120 
acccccagaa tggcgttcgc accaaactta 180 
atcagtttgt caatcttctc ttgttctgtg 240 
gcaatagttt tattgatgtg ctcaacagcc 300 
ttatcattgt cccggagctc tagggcctca 360 
gcagctctga agagaccttt tgaggtgaag 420 

432 



<400> 288 

tctggctcaa gtcaaagtcc 
cgttgtcccg ggtgtcatcc 
gaagaaacag caagagcagc 
ccagaatggc aggaatttgc 
cggccccttc acantcacac 
tatccatgag cttgagattg 



t.ggtcctctt ctccgcctcc 
tctgggggca gtaagggctc 
agaatcagaa ' ttagcaaagc 
aatcctgctt cgacaggctg 
acgctgacct ctaaggtggt 
attttg 



ttcttcatca tagtaataaa 60 
tttgaccacc gctctcctcc 120 
aagaattcct ccaagaatcc 180 
tgccttccta cagacgccgg 240 
cacttggtct ttattctggt 300 

326 



<210> 289 
<211> 451 
<212> DNA 
<213> Homo sapiens 

<400> 289 

gtcccggtgt ggctgtgccg ttggtcctgt 
cccagaccca agaccaaccg atggaggagg 
aaattgccca gttgatgtca ttgatcatca 
tgagagagct catttcaaat tcatcagatg 
cagatcccag taaattagac tctgggaaag 
aagatcgaac tctcactatt gtggatactg 
ataaccttgg tact ate gcc aagtctggga 
gtgcagatat ctctatgatt ggacctcggc 

<210> 290 
<211> 494 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> niisc_f eature 

<222> 421 

<223> n = A,T,C or G 



gcggtcactt agccaagatg cctgaggaaa 60 

aggaggttga gacgttcgcc tttcaggcag 120 

atactttcta ctcgaacaaa gagatctttc 180 

cattggacaa aatccggtat gaaagcttga 240 

agctgcatat taaccttata ccgaacaaac 300 

gaattggaat gaccaaggct gacttgatca 360 

ccaaagcgtt catggaagct ttgcaggctg 420 

c 451 



<400> 290 

tttttttttt tcaaaacagt atattttatt 
tcaattgtga catctagatg gcttaagatt 
atttttcaat cttccaaaca gcaaagactc 



ttacaatagc aaccaactcc ccagtttgtt 60 
actttctggt ggtcacccat gctgaacaat 120 
aaaagagatt ctgcatttca catcagttca 180 
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caagttcaag agtcttccat ttatcttagc 
gacaatgacg aagccactta attccttgtg 
cttcatttat gtgaataaag cagatgatga 
tagtggtagt gatgccagaa acttgtaaat 
ngtggttccc caaggctggt accaatgaga 
cctcctgctg ccca 

<210> 291 
<211> 535 
<212> DNA 
<213> Homo sapiens 



ttttggaata aattatcttt gaggtagaag 240 
tctgcataaa agcagattta ttcatcacaa 300 
taaaatgttc tcttattctt gtttaatcag 360 
gcacttcaaa ccaattgtgg ctcaagtgta 420 
ctggggtttg ggaattagtt ggtcatcatc 480 

494 



<400> 291 

tcgcgtgctt aacatgaaaa caaactttgt 
gtcttgtctc tcatcatggg gtgtctgacc 
catgcaataa tcttccaaaa tgtccaatac 
tgaaccttgt tcttttaact tagggagcag 
ggttactaga tgcatatctg ttgtataagg 
tggatgatta caacattttc tcaactgcat 
cttgcctgct gagttgagta tatctatatc 
ttgcattttg ctgaggccca catagatttt 
aacatcagcc ttaattcgac gaaggaggaa 

<210> 292 
<211> 376 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 4, 348 

<223> n = A,T,C or G 



gctgtttggt tcattgtatg cattgatgga 60 
atccaacctg cagtactcat aatttctcca 120 
ccttgtcatt tgactgaaga ttagtactcg 180 
cttgtctaaa accaccattt tgccactgtt 240 
tggaccaggt tctgctccat caaagagata 300 
taggatgttc aataacctca ttttgtccat 360 
cttcattaat atccgagtat accattccct 420 
tacttccttc tttggaggca aactcttttc 480 
tggacgcaaa accatatgaa gcctc 535 



<400> 292 

tacnagcccg tgctgatcga gatcctggtg 
aaaattggag cctgcccctc ggcccataag 
ggcccaagct actggtgcca gaacacagag 
tgcaaacgcc atgtgtggaa ctaggaggag 
ttggtttttt tctacttgtg tgtctggggg 
ataaaaatag ggctccccca cctcccccat 
tgcaagggag ccccta 

<210> 293 
<211> 320 
<212> DNA 
<213> Homo sapiens 



gaggtgatgg atccttcctt cgtgtgcttg 60 
cccttgttgg gaactgagaa gtgtatatgg 120 
acagcagccc agtgcaatgc tgtcgagcat 180 
gaatattcca tcttggcaga aaccacagca 240 
aatgaacgca cagatctgtt tgactttgtt 300 
ttttgtgtcc tttattgnag cattgctgtc 360 

376 



<400> 293 

tcggctgctt cctggtctgg cggggatggg 
cctcgcatgg cctgcagtct ggcagcagcc 
tcctccaggt agagttttct ttgcttatgt 
aagtgatgtt ggaatcgttt cttttgtttg 
aaagtttttt attagcattc tgaaagaagg 
ggccttcccc tttagaatag 

<210> 294 
<211> 359 
<212> DNA 



tttgctttgg aaatcctcta ggaggctcct 60 
ccgagttgtt tcctcgctga tcgatttctt 120 
tgaattccat tgcctctttt ctcatcacag 180 
tctgatttat ggttttttta agtataaaca 240 
aaagtaaaat gtacaagttt aataaaaagg 300 

320 
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<213> Homo sapiens 



<400> 294 

ctgtcataaa ctggtctgga gtttctgacg 
gagacttgct ggcctctccg ttgagtccac 
cactgttgat cactagcttt ttcttctgcc 
aaactgcaag aatcaaagcc aaggccaaga 
tttggggtgt ccttatagga ccagaggttg 
agtgtccatc tgattcagat ccatgagtgg 

<210> 295 
<211> 584 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 558 

<223> n = A,T,C or G 



actccttgtt caccaaatgc accatttcct 60 
ttggctttct gtcctccaca gctccattgc 120 
cacaccttct tcgactgttg actgcaatgc 180 
gggatgccaa gatgatcagc cattctggaa 240 
tgtttgctcc accttcttga ctcccatgtg 300 
tatgggaccc cccactgggg tggaatgtg 359 



<400> 295 

cctgagttgg gctgactgcc agagacagac 
tttacctcag tggttggcac ctggaacctg 
cgggcagtga agtaattgtc caggtctatg 
gtattcctca ggttgtggaa ctgggtctga 
tcccggagtg ggggaagctg tgaggtcagg 
aaactcagca gaatggtgaa ctggagaagt 
attgctgaag gaccagaatg tttatgcttt 
caaggccact gctctgccgc tccagccagc 
ccccaccctg gcagaacaca agggatgagc 
tgtggagcca aggccaanga cacactccag 

<210> 296 
<211> 287 
<212> DNA 
<213> Homo sapiens 



ccctctgggt ctcggtgaac cagccaggca 60 
tccagggccc tcacctgact gaggagccgc 120 
ctcttggggt ggataccata gccatccaag 180 
gtataggcag aactgggccc caggatgatc 240 
taagtatcca cgtccacccg taccccaatc 300 
ccttccgtta agtatttctt cagagaaagc 360 
ttggttttta aaatcttcca aaagacaaat 420 
aggttaccct cctcagtgtc aaaccccgta 480 
tccctgacgg ccccagagga aagcacaccc 540 
accacattca cttt 584 



<400> 296 

ccttatcatt cattcttagc tcttaattgt 
tttaaccaaa acatgtctcc tatcctggtt 
caagatttta aagacatgta ggtgtttgtt 
ttcagtccta agaaagagga gtgcttgtcc 
gtctgaagga cacttcctgc ctaagggaga 

<210> 297 
<211> 457 
<212> DNA 

<213> Homo sapiens 



tcattttgag ctgaaatgct gcattttaat 60 
tttgtagcct tcctccacat cctttctaaa 120 
catctgtaac tctaaaagat cctttttaaa 180 
cctaagagtg tttaatggca aggcagccct 240 
gtggtatttg cagacta 287 



<400> 297 

ccaattgaaa caaacagttc tgagaccgtt 
ggtattaggg' ataatattca tttagccttc 
ccagctccag cagccttctt gtccactgct 
atatcacgaa cagcaaagcg acccaaaggt 
ggcttgccag gaaccatatc aacaatggca 
tcttccagct ttttaccaga acggcgatca 
gcaatgtgag ccgtgtggca atccaataca 
tggttcagga taatcacctg agcagtgaag 



cttccaccac tgattaagag tggggtggca 60 
tgagctttct gggcagactt ggtgaccttg 120 
ttgatgacac ccaccgcaac tgtctgtctc 180 
ggatagtctg agaagctctc aacacacatg 240 
gcatcaccag acttcaagaa tttagggcca 300 
atcttttcct tcagctcagc aaacttgcat 360 
ggggcatagc cggcgcttat ttggcctgga 420 
ccagacc 457 
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<210> 298 
<211> 469 
<212> DNA 

<213> Homo sapiens 



<400> 298 

tctttgactt tccttgtcta cctcctctgg 

cagagatctc aatgattcct gattctcctc 

ttccacagac tccagtggtt cttgaatttc 

tatttcttct ggagtccaca gtggtgcttg 

ctcttgtccc gcagacttca gtgattctag 

tatgctctct tctttgagtg actttaagaa 

tatctcctgg tcaagagact tcagtggttc 

catctgatcc tgttccccta gagacctccg 



agatctcaaa ttctccaggt tccatgctcc 60 

ttccaggagt ctgaatgtct cttggttcac 120 

cttttctaga ggattcattg ccccctgatt 180 

agtttctgga gatttcagtg tttccaggtt 240 

gatctctgtt tctaaagatt ttactgcctc 300 

ctcttgattc tcattttcaa gaggtctagc 360 

tagatccact ttttctgggg gtcttaatgt 420 

tcgctgttga gtctctttt 469 



<210> 299 
<211> 165 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 37, 82, 144 
<223> n =^ A,T,C or G 



<400> 299 

tctgtggaga ggatgaggtt gagggaggtg gggtatntcg ctgctctgac cttaggtaga 60 
gtcctccaca gaagcatcaa antggactgg cacatatgga ctcccttcac aggccacaat 120 
gatgtgtctc tccttcgggc tggnccggta tgcacagttg gggta 165 



<210> 300 
<211> 506 
<212> DNA 

<213> Homo sapiens 



<400> 300 

tctgaggaaa gtttgggctt attagtattt 
gcggacaacg taactaccag ctccttggct 
gtaggttctg tcattattgt tggcacatag 
tgagcgctcc tccattgtga aaccaaatat 
cacactgagg aagacagaac catttagcac 
tctcacagag taattgacgg agatatatga 
ctcatcagcg gagatgttga agttacctga 
atccaagaag gtctttccca tcagggcagg 
gaatgctcct tgagaaattt ccgtga 

<210> 301 
<211> 304 
<212> DNA 

<213> Homo sapiens 



gctccagcga acctccaagt tttctccatt 60 
cagtggttcg cctccactca gaagttccca 120 
gccctgaata caggtgatat agggccccca 180 
agtatcattc attttctggg ctttctccat 240 
agtgacattg gtgaaatatg tttcattgat 300 
ttgtgagtca ggaggtgtca cagttatagg 360 
agcagagacg caagaagagt ctttgttaat 420 
taagacctgg gctgcagcgt ttggattgct 480 

506 



<220> 

<221> misc_feature 

<222> 221, 223, 252, 275, 280 

<223> n = A,T,C or G 



<400> 301 
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tcctaaggca gagcccccat cacctcaggc 
tgcccctttc ctctccctca gaatttgtgt 
cttctggggg gggtctagaa cagtgcctgg 
tgttgaatgt ctcctctctc tttccactct 
accctgtatt tntttctggt gcccattcca 
ctcc 

<210> 302 
<211> 492 
<212> DNA 
<213> Homo sapiens 



ttctcagttc ccttagccgt cttactcaac 60 
ttgctgcctc tatcttgttt tttgtttttt 120 
cacatagtag gcgctcaata aatacttgtt 180 
gggaaaccta ngnttctgcc attctgggtg 240 
tttgnccagn taatacttcc tcttaaaaat 300 

304 



<400> 302 

ttttcagtaa gcaacttttc catgctctta 
tattagattg aatggaaaag cacttgccat 
tcctgtatca catacggagg tcttgtgtat 
ctttatttgc tgctgtttaa gttgccaacc 
cctgcctttg tagttctggt attcacttta 
gaatacaagc attgcttttg gcaaattaaa 
gggaaataaa ttaaagtaca caagtccaag 
ttgtgcacat gtgagagggt gtccagtttg 
actattgtgt gt 

<210> 303 
<211> 470 
<212> DNA 
<213> Homo sapiens 



atgtattcct ttttagtagg aatccggaag 60 
ctctgtctag gggtcacaaa ttgaaatggc 120 
ctgtggcaac agggagtttc cttattcact 180 
tcccctccca ataaaaattc acttacacct 240 
ctatgtgata gaagtagcat gttgctgcca 300 
gtgcatgtca tttcttaata cactagaaag 360 
tctaaaactt tagtactttt ccatgcagat 420 
tctagtgatt gttatttaga gagttggacc 480 

492 



<400> 303 

tctggggcag caggtactcc ctacggcact 
gcagcggtgg ctcacatggc ctgtctgcac 
ttggtctcct tggaagacag gtctgatgtt 
aaacttgtat cttacgtgaa cttaaagaat 
aggactggca cgacaggccc acggcagatt 
gaattccagc caccacttct gattcgattc 
agaagccatt gtcaccccag tcagtgttcc 
cattctccac tccccagccc aggatgcgga 

<210> 304 

<211> 79 

<212> DNA 

<213> Homo sapiens 



agtctacagg gggaaggacg ctctgtgctg 60 
tgtaaccaca ggctgggatg tagccaggac 120 
tggccaatcc agtccttcag accctgcctg 180 
aaaatgcatt tctaccccga tctcgccccc 240 
agatcttttc ccagtactga tcggtgcgtg 300 
cacagtgatc ctgtcctctg agtattttaa 360 
aggagttggc aaccagccag tagggtgtgc 420 
tggcatggac ctcggccgcg 470 



<400> 304 

tgtcccattg ttaactcagc ctcaaatctc aactgtcagg ccctacaaag aaaatggaga 60 
gcctcttctg gtggatgcg 79 

<210> 305 
<211> 476 
<212> DNA 
<213> Homo sapiens 



<400> 305 

tcactgagcc accctacagc cagaagagat 

acattaagaa aattggctac aaccccgaca 

atggtgacaa catgctggag ccaagtgcta 

aagctctggg aatggcgatt tcatgcttac 

ttggttcaag ggatggaaag tcacccgtaa 



atgaggaaat tgttaaggaa gtcagcactt 60 

cagtagcatt tgtgccaatt tctggttgga 120 

acgtaagtgg ctttcaagac cattgttaaa 180 

acaaattggc atgcttgtgt ttcagatgcc 240 

ggatggcaat gccagtggaa ccacgctgct 300 
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tgaggctctg gactgcatcc taccaccaac tcgtccaact gacaagccct tgcgcctgcc 360 
tctccaggat gtctacaaaa ttggtggtaa gttggctgta aacaaagttg aatttgagtt 420 
gatagagtac tgtctgcctt cataggtatt tagtatgctg taaatatttt taggta 476 

<210> 306 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<400> 306 

tctgtctcgg agctcagggc gcagccagca cacacaggag cccacaggac agccacgtct 60 
tcacagaaac tacagaagtc aggacccagg cgaggacctc aggaacaagt gccccctgca 120 
gacagagaga cgcagtagca acagcttctg aacaactaca taataatgcg gggagaatcc 180 
tgaagaccac tgcatcccac aagcactgac aaccacttca ggattttatt tcctccactc 240 
taacccccag atccatttat gagaagtgag tgaggatggc aggggcatgg agggtgaagg 300 
gacagcaagg atggtctgag ggcctggaaa caatagaaaa tcttegtcct ttagcatatc 360 
ctggactaga aaacaagagt tggagaagag gggggttgat acta 404 

<210> 307 
<211> 260 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> 10, 255, 257 
<223> n = A,T,C or G 

<400> 307 

tcctgcctan acatctgtga gggcctcaag ggctgctgcc tcgactttct ccctagctaa 60 
gtccacccgt ccagggacac agccagggca ctgctctgtg ctgacttcca ctgcagccaa 120 
gggtcaaaat gaagcatctg cggaggccag gactccttgg catcggacac agtcagggga 180 
aaagccaccc tgactctgca ggacagaggg tctagggtca tttggcagga gaacactggt 240 
gtgccaaggg aagcnancat 260 

<210> 308 

<211> 449 

<212> DNA 

<213> Homo sapiens 

<400> 308 

tctgtgctcc cgactcctcc atctcaggta ccaccgactg cactgggcgg ggccctctgg 60 

ggggaaaggc tccacggggc agggatacat ctcgaggcca gtcatcctct ggaggcagcc 120 

caatcaggtc aaagattttg cccaactggt cggcttcaga gtttccacag aagagaggct 180 

ttcgacgaaa catctctgca aagatacagc caacactcca catgtccaca ggtgttgcat 240 

atgtggactg cagaagaact tcgggagctc ggtaccagag tgtaacaacc ttgatcgttt 300 

cggctggcaa gcctggtggg ggtgccttgt ccagatatgt ccttaggtcc tggtctacat 360 

gctcaaacac cagggttacc ttgatctccc ggtcagttcg ggatgtggca cagacgtcca 420 
tcagccggac aacattggga tgctcaaaa 449 

<210> 309 
<211> 411 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 384 
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<223> n = A,T,C or G 



<400> 309 

ctgtggaaac ctggggtgcc gggtaaatgg 
caactgagag acgttccatg agcagggagg 
tgtggaaaac caagaagccc tgaagaccgg 
ccaacacaag gtcaatgatc tccttgccaa 
cagcatcttc cttgcctgtg atgagctgct 
tgcgaacttc atcaatgact gtgggttcca 
tgccagcgcc cgtctcactt gaanaagggt 

<210> 310 
<211> 320 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 250 

<223> n = A,T,C or G 



agaactccag cttggatttc ttgccataat 60 
tgaacccaga accagttccc ccaccaaagc 120 
tgcactggtc agccagcttg cgaattcggt 180 
tggtgtagtg ccctcgggca tagttattgg 240 
cagggtggaa gagctggcgg taggtgccag 300 
agtctacaaa cacagcccgg ggcacgtgct 360 
g-tttgaagga agtcatctcc t 411 



<400> 310 

tcctcgtcca gcttgactcg attagtcctc 
cgggaaatgc ctgcctggca gtggacaaac 
atgaagtcaa tggcctcgtt gaaccaggag 
gggatgctct tgtactggta gtgaccctca 
atcaaggcan ttatgcccaa ggcatccagc 
cccaggtaca gaaagggcag 

<210> 311 
<211> 539 
<212> DNA 
<213> Homo sapiens 



ataaggtaag caaggcagat ggtggctgac 60 
acccttcctc cagcattctt gatggagtct 120 
ctgatgtctg ccttgtggtt gtcctccaca 180 
aaatggttgg gacaattggc tgagacgttg 240 
atgtccttgc gggaagcgtg atacgcactg 300 

320 



<400> 311 

tctggcccat gaagctgaag ttgggagaga 
aggcctcgtc cagcttgact cgattagtcc 
accgggaaat gcctgcctgg cagtggacaa 
ctatgaagtc aatggcctcg ttgaaccagg 
cagggatgct cttgtactgg tagtgaccct 
tgatcaaggc agttatgccc aaggcatcca 
tgcccaggta cagaaagggc aggatttcca 
acattcatca agcttgctca aagccaaggc 
ggaaaagtca atttcagata ccgagtgaac 

<210> 312 
<211> 475 
<212> DNA 
<213> Homo sapiens 



tgatgcttcg cctctgcttc acaaactcaa 60 
tcataaggta agcaaggcag atggtggctg 120 
acacccttcc tccagcattc ttgatggagt 180 
agctgatgtc tgccttgtgg ttgtcctcca 240 
caaaatggtt gggacaattg gctgagacgt 300 
gcatgtcctt gcgggaagcg tgatacgcac 360 
ccgggccacc ctgaaatcca gaaatatcca 420 
cagtgcccat acccacaaaa actttctgct 480 
tcagttctgt tgctggagga taaataaat 539 



<400> 312 

tcaaggatct tcctaaagcc accat.gt.gag 
cagaccatgt cctgctgttc tagggtcatg 
caggggtcag tgatacccac tgaacctggc 
cagcgcctgt atgcaacgtt tagggccaaa 
taatggccta gtaggtcaag gatccagggt 
ctcaagtcct tcagtagtct gatgagaact 
aaccagcggg catgtcggat ggctgctaag 



aggattcgga cgagagtctg agctgtatgg 60 
actgtgtgta ctctaaagtt gccactctca 120 
aggaacagtc ctgcagccag aatctgcaag 180 
ggctgtctgg tggggttgtt catcacagca 240 
gtgaggggct caaagccagg aaaacgaatc 300 
ttaactgtgg actgagaagc attttcctcg 360 
gcactctgca atactttgat atccaaatgg 420 
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agttctggat ccagttttcg aagattgggt ggcactgttg taatgagaat cttca 475 

<210> 313 
<211> 456 
<212> DNA 
<213> Homo sapiens 

<400> 313 

tccacttaaa gggtgcctct gccaactggt ggaatcatcg ccacttccag caccacgcca 60 

agcctaacat cttccacaag gatcccgatg tgaacatgct gcacgtgttt gttctgggcg 120 

aatggcagcc catcgagtac ggcaagaaga agctgaaata cctgccctac aatcaccagc 180 

acgaatactt cttcctgatt gggccgccgc tgctcatccc catgtatttc cagtaccaga 240 

tcatcatgac catgatcgtc cataagaact gggtggacct ggcctgggcc gtcagctact 300 

acatccggtt cttcatcacc tacatccctt tctacggcat cctgggagcc ctccttttcc 360 

tcaacttcat caggttcctg gagagccact ggtttgtgtg ggtcacacag atgaatcaca 420 

tcgtcatgga gattgaccag gaggacctcg gcccgc 456 

<210> 314 
<211> 477 
<212> DNA 
<213> Homo sapiens 



<400> 314 

tgcgtgggct tctggaagcc tggatctgga 
gcgtaccctg gtgttcttct gattggcact 
gggggcatct gcattggagt tgggggtgcc 
tgggagctga agtgccccaa ggtgattggc 
tcctcaccca aagatgtgat cctgaaggtg 
ggtgcaatcg tggaatacca cgggcctggt 
acaatctgca acatgggtgc agaaattggg 
aggatgaaga agtatctgag caagaccggc 

<210> 315 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 35 

<223> n = A,T,C or G 



atcattcacc agattattct ggaaaactat 60 

gactcccaca cccccaatgg tggcggcctt 120 

gatgctgtgg atgtcatggc tgggatcccc 180 

gtgaagctga cgggctctct ctccggttgg 240 

gcaggcatcc tcacggtgaa aggtggcaca 300 

gtagactcca tctcctgcac tggcatggcg 360 

gccaccactt ccgtgttccc ttacaaccac 420 

cgggaagaca ttgccaatct agctgat 477 



<400> 315 

caggtactgg atgtcaggtc tgcgaaactt cttanatttt gacctcagtc cataaaccac 60 

actatcacct cggccatcat atgtgtctac tgtggggaca actggagtga aaacttcggt 120 

tgctgcaggt ccgtgggaaa atcagtgacc agttcatcag attcatcaga atggtgagac 180 

tcatcagact ggtgagaatc atcagtgtca tctacatcat cagagtcgtt cgagtcaatg 240 

g 241 

<210> 316 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 1, 4, 32, 39, 68, 77, 82, 94, 166, 172, 195, 196 
<223> n = A,T,C or G 
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<400> 316 

nttntgtgat agtgtggttt atggactgag 
catccaancc tgcccgngcg gncgctcgaa 
ggcggccgct cgagcatgca tctagagggc 
attcactggc cgtcnnttta caacgtcgtg 



gncaaaatnt aagaagtttc gcagacctga 60 
aggncgaatt ctgcagatat ccatcacact 120 
ccaattcgcc ctatantgag tnatattaca 180 
actgggaaaa ccctggcgtt acccaactta 240 

241 



<210> 317 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 15, 25, 135, 154, 193 

<223> n = A,T,C or G 



<400> 317 

aggtaccctg ctcancagcc tgggngcctg 
tctgctctgc atttttttgt tcctcttttg 
ttatagggct acaantacct cggccgaaac 
tcacactggc ggncgctcga gcatgcatct 



ggttgtctcc ttgtccatcc actggtccat 60 
gaggttccac tttgggtttg ggctttgaaa 120 
cacnctaagg gcgaattctg cagatatcca 180 
agagggccca attcgcccta tagtgagtcg 240 

241 



<210> 318 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 3, 5, 10, 11, 24, 28, 31, 34, 40, 42, 47, 53, 74, 80, 96, 
101, 127, 129, 136, 138, 205, 241 
<223> n = A,T,C or G 



<400> 318 

cgngnacaan ntacattgat gganggtntg 
ttaatatttt cttnaacgtn cctcaccttc 
tgt get nana accacntnaa ctttacatcc 
ctctgccgcg accacgctaa gggcnaattc 



nggntctgan tntttantta cantggagca 60 
ctgaantaaa nactctgggt tgtagcgctc 120 
ctcttttgga ttaatccact gcgcggccac 180 
tgcagatatc catcacactg gcggccgctc 240 

241 



<210> 319 
<21X> 241 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 24, 36, 39 
<223> n = A,T,C or G 



<400> 319 

caggtactga tcggtgcgtg gaantccagc 
ctgtcctctg agtattttaa agaagccatt 
aaccagccag tagggtgtgc cattctccac 
acccatcatc tctccggtga cgtgttggta 



caccanttnt gattcgattc cacagtgatc 60 
gtcaccccag tcagtgttcc aggagttggc 120 
tccccagccc aggatgcgga tggcatggcc 180 
cctcggccgc gaccacgcta agggcgaatt 240 
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c 241 

<210> 320 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 27, 215, 216, 217, 220, 222, 235 
<223> n = A,T,C or G 

<400> 320 

ggcaggtacc aacagagctt agtaatntct aaaaagaaaa aatgatcttt ttccgacttc 60 

taaacaagtg actatactag cataaatcat tctagtaaaa cagctaaggt atagacattc 120 

taataatttg ggaaaaccta tgattacaag tgaaaactca gaaatgcaaa gatgttggtt 180 

ttttgtttct cagtctgctt tagcttttaa ctctrmnaan cncatgcaca cttgnaactc 240 
t 241 

<210> 321 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 2, 25, 26, 228 
<223> n = A,T,C or G 

<400> 321 

angtaccaac agagcttagt aattnntaaa aagaaaaaat gatctttttc cgacttctaa 60 

acaagtgact atactagcat aaatcattct agtaaaacag ctaaggtata gacattctaa 120 

taatttggga aaacctatga ttacaagtga aaactcagaa atgcaaagat gttggttttt 180 

tgtttctcag tctgctttag cttttaactc tggaagcgca tgcacacntg aactctgctc 240 



a 

<210> 322 
<211> 241 
<212> DNA 
<213> Homo sapiens 



241 



<400> 322 

ggtaccaaca gagcttagta atttctaaaa agaaaaaatg atctttttcc gacttctaaa 60 
caagtgacta tactagcata aatcattctt ctagtaaaac agctaaggta tagacattct 120 
aataatttgg gaaaacctat gattacaagt. aaaaactcag aaatgcaaag atgttggttt 180 
tttgtttctc agtctgcttt agcttttaac tctggaagcg catgcacact gaactctgct 240 



c 



<210> 323 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 323 

cgaggtactg tcgtatcctc agccttgttc tatttcttta ttttagcttt acagagatta 60 

ggtctcaagt tatgagaatc tccatggctt tcaggggcta aacttttctg ccattctttt 120 

gctcttaccg ggctcagaag gacatgtcag gtgggatacg tgtttctctt tcagagctga 180 

agaaaggg-tc tgagctgcgg aatcagtaga gaaagccttg gtctcagtga ctccttggct 240 
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241 



<210> 324 
<211> 241 
<212> DMA 

<213> Homo sapiens 



<400> 324 

aggtactgtc gtatcctcag ccttgttcta tttctttatt ttagctttac agagattagg 60 
tctcaagtta tgagaatctc catggctttc aggggctaaa cttttctgcc attcttttgc 120 
tcttaccggg cLIgaagga catgtcaggt gggatacgtg tttctctttc agagctgaag 180 
aaagggtctg agctgcggaa tcagtagaga aagccttggt ctcagtgact ccttggcttt 240 



<210> 325 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<400> 325 



ggcaggtaca tttgttttgc ccagccatca ctcttttttg tgaggagcct ^aatacattc 60 
ttcctggggt ccagagtccc cattcaaggc agtcaagtta agacactaac ttggcccttt 120 
cctgatggaa atatttcctc catagcagaa gttgtgttct gacaagactg ^^^g^^ttac 180 
atgttgggaa aaaaaaagaa gcattaactt agtagaactg aaccaggagc attaagttct 240 



g 

<210> 326 
<211> 241 
<212> DNA 
<213> Homo sapiens 



gcaggtacat ttgttttgcc cagccatcac tcttttttgt gaggagccta ^atacattct 60 
tcctggggtc cagagtcccc attcaaggca gtcaagttaa gacactaact tggccctttc 120 
ctgaSgaaa tatticctcc atagcagaag ttgtgttctg acaagactga gagagttaca 180 
tgttgggaaa aaaaagaagc attaacttag tagaactgat ccaggagcat taagttctga 240 



<210> 327 
<211> 241 
<212> DNA 

<:213> Homo sapiens 
<400> 327 



ggtaccagac caagtgaatg cgacagggaa ttatttcc1.g tgttgataat toatgaagta 60 
gaacagtata atcaaaatca attgtatcat cattagtttt ccactgcctc acactagtga 120 
gctgtgccaa gtagtagtgt gacacctgtg ttgtcatttc ccacatcacg ^aagagcttc 180 
caaggaaagc caaatcccag atgagtctca gagagggatc aatatgtcca tgattatcag 240 



g 

<210> 328 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 6, 19, 66, 232, 240 
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<223> n = A,T,C or G 



<400> 328 

ggtacnagac caaatgaang ccacagggaa 
gaacantata atcaaaatca attgtatcat 
gctgtgccaa gtagtagtgt gacacctgtg 
caaggaaagc caaatcccag atgagtctca 

g 

<210> 329 
<211> 241 
<212> DNA 
<213> Hoioo sapiens 



ttatttcctg tgttgataat tcatgaagta 60 
cattagtttt ccactgcctc acactagtga 120 
ttgtcatttc ccacatcacg taagagcttc 180 
gagagggatc aatatgtcca tnatcatcan 240 

241 



<220> 

<221> laisc^feature 

<222> 33, 61, 220, 228, 229, 240, 241 
<223> n = A,T,C or G 



<400> 329 

ttcaggtcga gttggctgca gatttgtggt 

ngcttcaaag tattattaaa aacatatgga 

accaggagat ttggatagga atggggctga 

ctgataagaa gtaaggcttt gaaagcttca 
n 



gcnttctgag ccgtctgtcc tgcgccaaaa 60 
tccccatgaa gccctactac accaaagttt 120 
tgggcttcat cgtttataaa atccgggctg 180 
gcgcctgctn ctggtcanna ctaaccatan 240 

241 



<210> 330 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 330 

ttttgtgcag atttgtggtg cgttctgagc 

attattaaaa acatatggat ccccatgaag 

tggataggaa tggggctgat gggcttcatc 

agtaaggctt tgaaagcttc agcgcctgct 



cgtctgtcct gcgccaagat gcttcaaagt 60 
ccctactaca ccaaagttta ccaggagatt 120 
gtttataaaa tccgggctgc tgataaaaga 180 
cctggtcatc actaaccaga tttacttgga 240 

241 



<210> 331 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 1, 9, 41, 60, 61, 119, 124, 132, 139, 141, 153, 168 
<223> n = A,T,C or G 



<400> 331 

nttttaggna ctttgggctc cagacttcac 
ntgcattcct catgactgag gttaacttaa 
ttcnggtaag anacaaatnt nctttaaaaa 
gcacaaactg gccactgaca aaaatgaccc 
c 



tggtcttagg nattgaaacc atcacctggn 60 
aacaaaaatg gtaggaaagc tttcctatnc 120 
aangtggaag gcatgacnta cgtgagaact 180 
ccatttgtgt gacttcattg agacacatta 240 

241 



<210> 
<211> 
<212> 



332 
241 
DNA 
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<213> Homo sapiens 



<400> 332 

tgtgaggaga gggaacatgc tgagaaactg 

atcttccttc aggatatcaa gaaaccagac 

atggagtgtg cattacattt ggaaaaaaat 

ctggccactg acaaaaatga cccccatttg 

g 

<210> 333 
<21i> 241 
<212> DWA 

<213> Homo sapiens 



atgaagctgc agaaccaacg aggtggccga 60 
tgtgatgact gggagagcgg gctgaatgca 120 
gtgaatcagt cactactgga actgcacaaa 180 
tgtgacttca ttgagacaca ttacctgaat 240 

241 



<220> 

<22r.> misc_feature 

<222> 44, 52, 60, 98, 104, 108, 124, 126, 190, 198, 206, 214 
<223> n = A,T,C or G 

<400> 333 

caggtacaag cttttttttt tttttttttt tttttttttt ttgnaaatac tntttattgn 60 
aaatattcta tcctaaattc catatagcca attaattntt acanaatntt ttgttaattt 120 
ttgngngtat aaattttaca aaaataaagg gtatgtttgt tgcacacaac ttacaaataa 180 
taataaactn tttattgnaa atattnttta ttgnaaatat tctttatcct aaattccata 240 



<210> 334 
<213> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 10, 16, 22, 24, 49, 158, 159, 237 
<223> n = A,T,C or G 



<400> 334 

tacctgctgn aggggntgaa gncntctctg 
ggttctgccc ctgctgcagc agaggagaag 
tcagatgatg acatgggatt tggccttttt 
cctttttaca caaaaaaaaa aaaaaaaaaa 

g 

<210> 335 
<211> 241 
<212> DNA 
<213> Homo sapiens 



ctgccccagg catctgcanc ccctgctgct 60 
aaagatgaga agaaggagga gtctgaagag 120 
gattaaannc ctgctcccct gcaaataaag 180 
aaaaaaaaaa aagcttgtac ctgcccnggc 240 

241 



<220> 

<221> misc_feature 
<222> 39 

<223> n = A,T,C or G 
<400> 335 

ctatgtgctg ggatgactat ggagacccaa 
tggctgcttc aaccattgac agttttgctg 
gctcccccaa aggctgtgct gaaacttgag 
tctgtgactc tgacatgcca gggggctcgc 



atgtctcana atgtatgtcc cagaaacctg 60 
ctgctggctt ctgcagacag tcaagctgca 120 
cccccgtgga tcaacgtgct ccaggaggac 180 
agccctgaga gcgactccat tcagtggttc 240 
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241 



<210> 336 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 336 

taccaaccta tgcagccaag caacctcagc 
cgaaagtatc atctcaggga aacttaattc 
atagttccct cacttgattt ttttaacctt 
taatactttt ttttttcttg atgttttctt 
a 

<210> 337 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 47, 56, 69, 228 
<223> n = A,T,C or G 



agttcccatc aaggccacct ccaccacaac 60 
ctgcccgtcc tgctcctgca cctcctttat 120 
ctttttgcaa atgtcttcag ggaactgagc 180 
gaaaagcctt tctgttgcaa ctatgaatga 240 

241 



<400> 337 

ggtactgtat gtagctgcac tacaacagat 

taaatggtna atactgactt tttttttatt 

cagaactgtt ttaaaccttt gtgtgctggt 

tttcctgata ccagactgtt tcccgtggtt 
a 



tcttaccgtc tccacanagg tcatanattg 60 
cccttgactc aagacagcta acttcatttt 120 
ttataaaata atgtgtgtaa tccttgttgc 180 
ggttagaata tattttgntt tgatgcttat 240 

241 



<210> 338 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 338 

aggtacaggt gtgcgctgag ccgagtttac 
ctcaaatgga gttttccact ttcctttgaa 
atccccatct acagaacctt caggtaacaa 
gacccaggaa ttaatctttt ttctagcttc 

g 

<210> 339 
<211> 241 
<212> DNA 
<213> Homo sapiens 



acggaaagga taaagcccat ttagtttctt 60 
gtagacagca ttcaccagga tcatcctggt 120 
gtttgggatt ttgcctttgg tttgagtctt 180 
ttctgcacat tctaggaagt ctactgcctg 240 

241 



<400> 339 

taccgacggc tcctggaggg 
aaagtgtctg caactccaaa 
gttctttgtc aagtgaatga 
gttgtaaaat ctattttccc 
a 



agagagtgaa gggacacggg 
gatcaaggcc ataacccagg 
aatccaaaag cacgcatgag 
ccaaggaaag tccttgcaca 



aagaatcaaa gtcgagcatg 60 
agaccatcaa cggaagatta 120 
accaatgaaa gtttccgcct 180 
gacaccagtg agtgagttct 240 

241 



<210> 
<211> 
<212> 



340 
241 
DNA 
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<213> Homo sapiens 



<400> 340 

gtagccctca cacacacatg cccgtaacag 
ccagacacag ggcgtatgga aagcacgtcc 
gatgcttacc taccacggcc gtctccacca 
tgtgacttac aaaccttgtt taaaagctgc 
c 



gatttatcac aagacacgcc tgcatgtaga 60 
tcaagactgt agtattccag atgagctgca 120 
gaaaaccatc gccaactcct gcgatcagct 180 
ttacatggac ttctgtcctt taaaagcttc 240 

241 



<210> 341 
<211> 241 
<212> DNA 

<213> Homo sapiens 



<400> 341 

gtaccgccta ctttcgtctc atgtctccga 
tgaaagctgc agttgccttt tgccctgcgt 
cagtggtagt ctgcatgtca tgccagcttt 
ggttcatgcc gagttttgtt ttatctcaac 
t 



acttcttgct gatggccgtt ccaacgttgc 60 
gactcagggt ttcatgtgtt ttcttgtagg 120 
tgctgaagtt ctgttttaat tcattcatca 180 
tagatgcctt tctttcgctg acaaaacttg 240 

241 



<210> 342 
<211> 241 
<212> DNA 

<213> Homo sapiens 



<400> 342 

gtacattggt gctataaata taaatgctac 
cttcaagttt tttctcttgt ctagcaatct 
acgttcctct gctgcatacc aacgttactc 
gtgctgagga atggaaaatg aaacccccac 
c 

<210> 343 
<211> 241 
<212> DNA 

<213> Homo sapiens 



ttatgaagca tgaaattaag cttctttttt 60 
gttaggcttc tgaaccaaga ccaaatgttt 120 
caaacaataa aaatctatca tttctgctct 180 
cccctgaccc ctaggactat acagtggaaa 240 

241 



<400> 343 

gtacatgtgg tagcagtaat ttttttgaag 
ctcattatca gattctgttc caaacaagta 
ctacagactt cttattatag aacagcattc 
tagttttggt taccatctaa aatattttta 
t 



caactgcact gacattcatt tgagttttct 60 
ttctgtagat ccaaatggat taccagtgtg 120 
tattctacat caaaaatagt ttgtgtaagt 180 
aatgttcttt acataaaaat ttatgttgtg 240 

241 



<210> 344 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<400> 344 

ggtacaaaat tgttggaatt tagctaatag 
gataacatta ctcaagtcac acacatataa 
aattgaaatt atggagattt cccaaaatga 
caatataaca tttaagatct tggatcaaat 



aaaaacatag taaatattta caaaaacgtt 60 
caatgtagac aggtcttaac aaagtttaca 120 
atctaatagc -bcattgctga gcatggttat 180 
gttgtccccg agtcttctgc aatccagtcc 240 

241 



<210> 345 
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<211> 241 
<212> DNA 

<213> Homo sapiens 



<400> 345 

ggtacgaagc tgagcgcacg ggggttgccc 

gaaaatgctc tctctctttg acaggcttcc 

ccccggcgat ttaactgatc ctgaaaagtg 

gcgttagatc ggcctctgag ggtggtgccc 

g 

<210> 346 
<211> 241 
<212> DNA 

<213> Homo sapiens 



cagcgtggag cctggacctc aaacttcacg 60 
agctgtctcc taatttcctg gatgaactct 120 
gtgagaggac tgaggaagac aaccaggtca 180 
ttgcctgagg agccaccctt taccaccttg 240 

241 



<400> 346 

caggtaccac tgagcctgag atggggatga 

tcagttgttc ctactcagac tgttgcactc 

ttaggtttta acacgaatcc tgacaccacc 

gcaaccttcc ctctctcctt ggtgaagaac 

g 

<210> 347 
<211> 241 
<212> DNA 

<213> Homo sapiens 



gggcagagag aggggagccc cctcttccac 60 
taaacctagg gaggttgaag aatgagaccc 120 
atctataggg tcccaacttg gttattgtag 180 
atcccaagcc agaaagaagt taactacagt 240 

241 



<400> 347 

aggtacatct aaaggcatga agcactcaat tgggcaatta acattagtgt ttgttctctg 60 

atggtatctc tgagaatact ggttgtagga ctggccagta gtgccttcgg gactgggttc 120 

acccccaggt ctgcggcagt tgtcacagcg ccagccccgc tggcctccaa agcatgtgca 180 

ggagcaaatg gcaccgagat attccttctg ccactgttct cctacgtggt atgtcttccc 240 
^ 241 

<210> 348 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 2, 18, 29, 35, 56, 51, 64, 76, 77, 85, 102, 103, 104, 189, 
232 

<223> n = A,T,C or G 



<400> 348 

angtacttgg caagattnga tgctcttgng 
tgancagagg aggagnncat catcntgtcc 
ctcaaacaag ttgataatgg agaaaaattt 
cgcctacang catacactag cgtggctaag 
c 

<210> 349 
<211> 241 
<212> DNA 

<213> Homo sapiens 



ctcantgaca tcattcataa cttgtnngtg 60 
tcattcgtca gnnncctctc ctctctgaat 120 
gaattctcag gattgaggct ggactggttc 180 
gcccctctgc accctgcatg anaaccctga 240 

241 
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<400> 349 

gcaggtacca tttgtctgac ctctgtaaaa aatgtgatcc tacagaagtg gagctggata 
atcagatagt tactgctacc cagagcaata tctgtgatga agacagtgct acagagacct 
gctacactta tgacagaaac aagtgctaca cagctgtggt cccactcgta tatggtggtg 
agaccaaaat ggtggaaaca gccttaaccc cagatgcctg ctatcctgac taatttaagt 



60 

120 

180 

240 

241 



<210> 350 

<211> 241 

<212> DNA 

<213> Homo sapiens 



<400> 350 

aggtactgtg gatatttaaa atatcacagt 
cgggccagac acttaagtga aagcagaagt 
gtcatatcat ttcaaaacat ttgcatcttg 
aaaccaaatc ttagaatcac ttcatttaaa 

g 



aacaagatca tgcttgttcc tacagtattg 60 
gtttgggtga ctttcctact taaaattttg 120 
gttggctgca tatgctttcc tattgatccc 180 
atactgagcg gtattgaata cttcgaagca 240 

241 



<210> 351 

<211> 241 

<212> DNA 

<213> Homo sapiens 



<400> 351 

tacagaaatc atttggagcc gttttgagac 
cattgcagct tggtccactg aagaagccac 
acccgcttta tgttcgcttc ctctcccctt 
ccacatacag gctttctcca aatgactccc 
c 



agaagtagag gctctgtcaa gtcaatactg 60 
gcctgagata caaaagatgc actacacttg 120 
ctctctcatc aactttatta ggttaaaaca 180 
tatgtctggg gtttggttag aattttatgc 240 

241 



<210> 352 

<211> 241 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 10, 28, 29, 49, 54, 59, 72, 127, 148, 150, 160, 166, 182 
<223> n = A,T,C or G 



<400> 352 

gtaccctgtn gagctgcacc aagattannt 
acgcaggcgt gnagtgcatc gtctgacccg 
gtcctcnggc tcatatgtgg gaaggcanan 
ancagcctct ggagaggggc cattaataaa 
a 



ggggccatca tgactgcanc cacnacgang 60 
gaaacccttt cacttctctg ctcccgaggt 120 
gatctctgan gagttncctg gggacaactg 180 
gctcaacatc attggcaaaa aaaaaaaaaa 240 

241 



<210> 353 

<211> 241 

<212> DNA 

<213> Homo sapiens 



<400> 353 

aggtaccagt gcattaattt gggcaaggaa 
ccttcaaagt atagagcttt tggggaagga 
ctgggctgac attaactaca attatgggaa 
tggttctctt ttggaatttt tttcaggtga 



agtgtcataa tttgatactg tatctgtttt 60 

aagtattgaa ctgggggttg gtctggccta 120 

atgcaaaagt tgtttggata tggtagtgtg 180 

tttaataata atttaaaact actataaaaa 240 
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c 241 

<210> 354 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 
<222> 1 

<223> n = A,T,C or G 
<400> 354 

ngcaggtccg ggcaggtacc aagattcatt ctcatcaaaa actagaaaca gaagggcaaa 60 
ttccagtttc cttctgggat tgaatacttt caagtaaggt cttcgacaaa caatcagggg 120 
gccaattaat ccactgtaga ggtccttaac ttgatccaca gttgaataat aagcccatgg 180 
aatacaagca gaatcctctg ttccagctcc agatctttct gggattttcc atacgtaagt 240 
g 241 

<210> 355 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<400> 355 

ggtacccacc ctaaatttga actcttatca agaggctgat gaatctgacc atcaaatagg 60 
ataggatgga cctttttttg agttcattgt ataaacaaat tttctgattt ggacttaatt 120 
cccaaaggat taggtctact cctgctcatt cactctttca aagctctgtc cactctaact 180 
tttctccagt gtcatagata gggaattgct cactgcgtgc ctagtctttc ttcacttacc 240 
t 241 

<210> 356 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 27 

<223> n = A,T,C or G 
<400> 356 

aggtactgta attgagcatc cggaatntgg agaagtaatt tagctacagg gtgaccaacg 60 
caagaacata tgccagttcc tcgtagagat tggactggct aaggacgatc agctgaaggt 120 
tcatgggttt taagtgcttg tggctcactg aagcttaagt gaggatttcc ttgcaatgag 180 
tagaatttcc cttctctccc ttgtcacagg tttaaaaacc tcacagcttg tataatgtaa 240 



c 

<210> 357 
<211> 241 
<212> DNA 

<213> Homo sapiens 



241 



<400> 357 

ttttgtacca ccgatatgat caaggaaaat tctgcccatt tttatggctg aagttctaaa 60 
aacctaattc aaagttcttc catgatccta cactgcctcc aagatggtcc aggctggcat 120 
aaggcctgag cggcggtgag atccgcggct gccagcagct tgtcgctctt cagctggtat 180 
gaagcccctc ggccacccga gtctccagga cctgcccggg cgccgctcga aagggcgaat 240 
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^ 241 

<210> 358 
<211> 241 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 25, 57 

<223> n = A,T,C or G 

<400> 358 

aggtacgggg agtgggggtg aagcntgttc 
caaagtcagt ggtcggccgc ttcgaccaac 
tctgggtgct gtgctcgagt ctctgaatat 
tgctctttgc tgactactgg aaagtgaaaa 
t 

<210> 359 
<211> 241 
<212> DNA 

<213> Homo sapiens 



tctacatagg caacacagcc gcctaantca 60 
atgtggtgag cattccacgg gcgcatgaag 120 
tttgatagga agcgacaaga aaattcaaac 180 
gatgctcaag tttaccattc aaagaaacca 240 

241 



<400> 359 

gaggtacaca aaaggaatac cttctgagag 
gacgtcaagg gtgcttttga ggaacatgac 
cctgctgggc tcttgtgact ataaatatac 
gaaagatctt gttatctttt actattgaga 
a 

<210> 360 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 1 

<223> n = AfTrC or G 



ccagggagtg aggaaagggg aaggagactt 60 
gggccagcca gcctgcccca actttgaggc 120 
tgtctatttc taatgcaatc cgtctttcct 180 
catgctttca tttttgtggt cctgtttcca 240 

241 



<400> 360 

ngtactctat actaattctg cctttttata 
caacaaattt tgtgattttt ataagaatct 
ttctccttta tttctttgct taaattcagt 
cacattctta ttcctaaaca ccagcagttc 
a 



cttaattcta aatttctccc ctctaattta 60 
atgcctcccc aattctcaga ttcttctctt 120 
ataagctttc ttggtatttt aggcttcatg 180 
ttcagagacc taaaatccag tataggaata 240 

241 



<210> 361 
<211> 241 
<212> DNA 

<213> Homo sapiens 



<400> 361 

aggtactctc cgtgccccga cactgaacat 

ccactttgta cttttcttac tatcctgtct 

agaaccactc ctagaaaatg ttctttcact 

actactgaac ttaaaatctt tttcttccct 



tatccagcca gatctgccca gtgccagctc 60 
agaatcatgt cttatgattt taacagatat 120 
ttctcgtttc ctttttaatc tatcatcctg 180 
tttttgtttc tcttttcttt tatcctgttc 240 
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a 241 

<210> 362 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 17, 23 

<223> n = A,TrC or G 

<400> 362 

aggtactttt atacctngct tangtcagtg acagatttac caatgacaac acaattttaa 60 
aattccaaca catatattac tttgtcctat gaagggcaaa aagtcaatat attttaaatt 120 
ttaaaaacag aatggatata atgacctttt tacacatcag tgatatttaa aagacttaaa 180 
gagacaatac tatggttgag acactggctt cctattccag ccctaattaa agaaaaaata 240 
g 241 

<210> 363 
<211> 241 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 4 

<223> n = A,T,C or G 
<400> 363 

ttangtacta aaaacaaaat cctaattctg ttttaaagag ctgggagatg ttaatcatat 60 
gctcagtttt tccacgttat aatttcctaa atgcaaactt ttcaatcagg gcagttcaaa 120 
ttcattacat cacagtaaat aacagtagcc aactttgatt ttatgcttat aggaaaaaaa 180 
atcctgtaga tataaaaaca gcaaattttg acaaataaaa ctcaaaccat tcatccctaa 240 
a 241 

<210> 364 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<400> 364 

ggtacaagca gttagtcctg aaggcccctg ataagaatgt catcttctcc ccactgagca 60 
tctccaccgc cttggccttc ctgtctctgg gggcccataa taccaccctg acagagattc 120 
tcaaaggcct caagttcaac ctcacggaga cttctgaggc agaaattcac cagagcttcc 180 
agcacctcct gcgcaccctc aatcagtcca gcgatgagct gcagctgagt atgggaaatg 240 
c 241 

<210> 365 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<400> 365 

cgaggtactg agattacagg catgagccac cacgcccggc caaaaacatt taaaaaatga 60 
ctgtccctgc tcaaatactg cagtaggaaa tgtaatttga catatatcac ttccagaaaa 120 
aaactttaaa tctttctata aaatgaattt gatacatcat cagcatgaag tgaagttaaa 180 
atctcttaca aagtaaattc aggtatatca acaatgagat ccaaaagtat cggttcaaga 240 
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t 241 

<210> 366 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<400> 366 

ggcaggtaca catcaaacac ttcattgcct aaatgcaggg acatgcttcc atctgaccac 60 
ttgactatcc gagcattgct ttctttaatt tcatttcctt cttcatctcg gcgtatcctc 120 
catcttatag tattttctac ctttaatttt aacctggttc taccttcttc atccagcatt 180 
tcttcatctt caaattcatc ttcataatac tgggctctac acttgagaaa gttgggcagt 240 
t 241 

<210> 367 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 25 

<223> n = A,T,C or G 
<400> 367 

gcaggtacaa ataattcctg ttgtnacatt tagtggacgc gattatctgt atacctcaaa 60 
ttttaattta agaaagtatc acttaaagag catctcattt tctatagatt gaggcttaat 120 
tactgaaaag tgactcaacc aaaaagcaca taacctttta aaggagctac acctaccgca 180 
gaaagtcaga tgccctgtaa ataactttgg tctttcaaaa tagtggcaat gcttaagata 240 
c 241 

<210> 368 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<400> 368 

tttgtacatt gttaatagtg accctcggag gaaatggatt tctcttctat taaaaactct 60 
atggtatata agcattacat aataatgcta cttaaccacc ttttgtctca agaattatca 120 
ccaaagtttt ctggaaataa gtccacataa gaattaaata tttaaaaggt gaaatgttcc 180 
ttattttaac tttagcaaga tcttttcttt ttcattaaga aacactttaa taattttaaa 240 
g 241 

<210> 369 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<400> 369 

gcaggtactt tattcttatt tcttatccta tattctgtgt tacagaaaaa ctactaccat 60 
aaacaaaaca ccaaccagcc acagcagttg tgtcaagcat gacaattggt ctagtcttca 120 
cattttatta gtaagtctat caagtaagag atgaagggtc tagaaaacta gacacaaagc 180 
aaccagggtc caaatcacca aggtagatct gtgcttagct aaagggaaac acccgaagat 240 
t 241 



<210> 370 
<211> 241 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 1 

<223> n = AfTfC or G 



<400> 370 

ngttcacagt gcccctccgg cctcgccatg 
gtggttctgt cgatcgtctt ggaaggccca 
agtgccttgg ataagctgaa ggagtttgga 
atcagccgca tcaaacagag tgaactttct 
a 



aggctcttcc tgtcgctccc ggtcctggtg 60 
gccccagccc aggggacccc agacgtctcc 120 
aacacactgg aggacaaggc tcgggaactc 180 
gccaagatgc gggagtggtt ttcagaagac 240 

241 



<210> 371 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 227 

<223> n = A,T,C or G 



<400> 371 

ggcaggtcat cttgagcctt gcacatgata 
aacaatatac tttacagggt gataataatc 
gcaagattga cttttatgac attggataaa 
gataactgac aaactaaatt atttccctag 
t 



ctcagattcc tcacccttgc ttaggagtaa 60 
tccatagtta tttgaagtgg cttgaaaaag 120 
atctacaaat cagccctcga gttattcaat 180 
aaaggaagat gaaaggnagt ggagtgtggt 240 

241 



<210> 372 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 26, 27, 59 
<223> n =^ A,T,C or G 

<400> 372 

aggtacagca aagcgaccct tggtgnnata gatcagacgg aaattctctc ccgtcttgnc 60 
aatgctgatg acatccatga atccagcagg gtaggttata tcagttcgga ccttgccatc 120 
gattttaatg aaccgctgca tgcaaatctt ctttacttca tctcctgtca gggcatactt 180 
aagtctgttc ctcaggaaaa tgatgagggg gagacactct ctcaacttgt ggggaccggt 240 
g 241 

<210> 373 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<400> 373 

tactgaaaca gaaaaaatgt attcccacaa aagctgttac acagcggttt cccgtcccca 60 

gaagcagtag aaaatcttag cattccaatg gaaggcatgt atttgtaaaa tattctaaaa 120 

tcagctctat agtttccttg tcctctttga taagggatca gacagagggt gtgtccccct 180 

tcagcagcta cccttcttga caaactggtc tccaataata cctttcagaa acttacaaga 240 
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c 241 

<210> 374 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<400> 374 

caggtactaa aacttacaat aaatatcaga gaagccgtta gtttttacag catcgtctgc 60 

ttaaaagcta agttgaccag gtgcataatt tcccatcagt ctgtccttgt agtaggcagg 120 

gcaatttctg ttttcatgat cggaatactc aaatatatcc aaacatcttt ttaaaacttt 180 

gatttatagc tcctagaaag ttatgttttt taatagtcac tctactctaa tcaggcctag 240 

c 241 

<210> 375 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<400> 375 

aggtacaaag gaccagtatc cctacctgaa gtctgtgtgt gagatggcag agaacggtgt 60 
gaagaccatc acctccgtgg ccatgaccag tgctctgccc atcatccaga agctagagcc 120 
gcaaattgca gttgccaata cctatgcctg taaggggcta gacaggattg aggagagact 180 
gcctattctg aatcagccat caactcagat tgttgccaat gccaaaggcg ctgtgactgg 240 
g 241 

<210> 376 

<211> 241 

<212> DNA 

<213> Homo sapiens 

<400> 376 

ggtacatttt actttccttc tttcagaatg ctaataaaaa acttttgttt atacttaaaa 60 

aaaccataaa tcagacaaac aaaagaaacg attccaacat cacttctgtg atgagaaaag 120 

aggcaatgga attcaacata agcaaagaaa actctacctg gaggaaagaa atcgatcagc 180 

gaagaaacaa ctcggggctg ctgccagact gcaggccatg cgaggaggag cctcctagag 240 

g 241 

<210> 377 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 234 

<223> n = A,T,C or G 
<400> 377 

tcctttctgt ccaggtgatt cacagactag acctttctta tcctcctcct agagttttga 60 
cttgggactc tagtgttaag atgatgagcc cgtgcatcag gtccttctgc actttggtgg 120 
aagtctccca gggtaggttt cctatttgaa acagtggaat catgtttcca gtgataaagt 180 
ttaatgacct catccttttt tttttttttc tcatctgcca tttgtgtgtc ttanatgggt 240 
t 241 



<2i0> 378 
<211> 241 
<212> DNA 
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<213> Homo sapiens 



<400> 378 

aggtcagcga tcaggtcctt tatgggcagc 
ggcactatct ccgctgcgac tccactcagc 
agtcctatga gaacctctgg ttccaggcca 
caagccagga ggacgactgt gtctttgggc 
t 

<210> 379 
<211> 241 
<212> DNA 
<213> Homo sapiens 



tgctgggcag ccccacaagc ccagggccag 60 
ccctcttggc gggcctcacc cccagcccca 120 
gccccttggg gaccctggta accccagccc 180 
cactgctcaa cttccccctc ctgcagggga 240 

241 



<400> 379 

tacggagcaa tcgaagaggc atatccacac 
aagatgactg ctttcactga ggtcaaggat 
aagcagaagt ttcttcagtg atcttctctc 
cagaggccGc ctgcgttctg ctgcattgct 
c 

<210> 380 
<211> 241 
<212> DNA 
<213> Homo sapiens 



ttggggtggc tatagggctg gaaaatgctg 60 
tgtaatattg ccagctttgt aaagccatta 120 
taagaaacac catcacctcc atgtgcctta 180 
tttgcgcaat cccttgatga tgaagatggt 240 

241 



<220> 

<221> iniBc_feature 

<222> 24, 25, 26, 34, 36, 56, 113, 129, 137, 184, 185, 208, 210, 
237, 240 

<223> n = A,T,C or G 



<400> 380 

acgtacacgc agaccgacat gggnnnttca 
atatccactc tctttttctt aagctcaggg 
tcggctaana tgcttcngaa tttgaattca 
tgannctgat cacccaccaa gtgggccntn 

g 



ggcntnagat caaactcaaa acctgnaatg 60 
aaatattcca agtagaagtc canaaagtca 120 
tgcacatagg ccttgaaaaa actgtcaaac 180 
tatgacacaa agcagaaacc tttctcntan 240 

241 



<210> 381 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 381 

aggtacaact taatggatta gcttttgggt ttaactgaat atatgaagaa attgggtctg 60 
tctaaagaga gggtatttca tatggctttt agttcacttg tttgtatttc atcttgattt 120 
ttttctttgg aaaataaagc attctatttg gttcagattt ctcagatttg aaaaaggctc 180 
tatctcagat gtagtaaatt atttcctttc agtttgtgaa agcaggattt gactctgaaa 240 
g 241 



<210> 382 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 382 

gtactgctat aatcaatacg tctgatagac aggtttatcc actatattga ccctacctct 60 
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aaaaggattg tcataattta tatgctttat 
cacaaaattt ttcattgtaa ttaaaaaaag 
taagaaaatc acaggagtag ataaatactc 
t 



gtttacacct atgatacagt tgccttggaa 120 
aagagttgtg cagacagaag aaatcaaatc 180 
tagaattcat atacccttgg aagatgggtt 240 

241 



<210> 383 
<211> 241 
<212> DNA 

<213> Homo sapiens 



<400> 383 

ggcaggtaca aagtcttctc tttgcttttt 
gtatttaagt ctgtgcaaat aatccttcag 
tcattttgtc tctcaaagat gattaaatga 
gcagaactca aaaggccttc aagctgttca 
a 

<210> 384 
<211> 241 
<212> DNA 

<213> Homo sapiens 



ataattttaa agcaaataac acatttaact 60 
aagaaatatc caagattctg tttgcagagg 120 
gtttgtcttc agataaagtg ctcctgtcca 180 
gtaagtgtag ttcagataag actccgtcat 240 

241 



<400> 384 

ggtacacaaa atacacttgc aagcttgctt 
attctataaa atcagttata tcaacatata 
agtaaatatg accaaactga ctaaataaga 
aaggcctttg gagtaaacaa aatgctttaa 
a 



acagagacct gttaaacaaa gaacagacag 60 
aaggagtgtg attttcagtt tgttttttta 120 
aggcaaaaca aaaaattatg cttccttgac 180 
ggctcctggt gaatggggtt gcaaggatga 240 

241 



<210> 385 

<211> 241 

- <212> DNA 

<213> Homo sapiens 



<400> 385 

ggcaggtcta caatggctct gtcccttctg 
tggtccctga ccccacagtg agctgtttag 
aagacactcc aatcctcagt gaagactctc 
ggtttggagg ctatgtccct ttaacttatc 
a 



tggaatcgtt acaccaagag gtctcagtcc 60 
atgatccttc acatcttcct gatcaactgg 120 
tggagccctt caactctctg gcaccaggta 180 
catgcagagt agccaaactt tacctgaaag 240 

241 



<210> 386 
<211> 241 
<212> DNA 

<213> Homo sapiens 



<400> 386 

aggtaccttt ttcctctcca aaggaacagt 
tacatcaaat ttaaaccata tgttaaactg 
ctcagctgat ctacacaagt ttcaaagtca 
aaattatctt aaattattaa ttaaaaaaaa 



ttctaaagtt ttctgggggg aaaaaaaact 60 
catattagtt gtgttacacc aaaaaattgc 120 
ttaatgcttg atataaattt actcaacatt 180 
aactttctaa gggaaaaata aacaaatgta 240 

241 



<210> 387 
<211> 241 
<212> DNA 
<213> Homo sapiens 
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<400> 387 

accccactgg ccgctgtgga gtatctccac 
cagtcgaact ttcgtaaatg gagttaatgt 
ttttggtcca gaatttcctg gccttccggc 
gccaattgct ccccgatcac ctttaagacc 
t 



tctcccctcg tgagggccgc tcccaccgac 60 
gtttccactc cccttttccc ctttctggcc 120 
atatcctggg agtcctcgac ttccaggaaa 180 
cggaggacct attggacctg gaaatcctcg 240 

241 



<210> 388 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 388 

tttgtactct tgtccacagc agagacattg 
acttcaatct gaggaacacc tcggggtgca 
agcaggttgt tatcctttgt catggcacgc 
ggctggttgt cagaataggt agtgaaggtc 
c 



agtataccat tggcatcaat gtcaaaagtg 60 
ggaggtatgc ctgtgagttc aaacttgcca 120 
tcgccttcat aaacctgaat aagtacacca 180 
tgtgtctgct tggtaggaat ggtggtatta 240 

241 



<210> 389 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> S, 28, 38, 43 
<223> n = A,T,C or G 



<400> 389 

tacctntgtt agtgagcacc ttgtcttntg tgcttatntc ttnaagataa atacatggaa 60 
ggatgtgaaa atcggaacac caactatgtg tctcactgca tctaagtgaa gcagccacag 120 
ctgtgagagt tttcaaagca gaaagatgct gatgtgacct ctggaattca gacatactga 180 
gctatgggtc agaagtgttt tacttaaaaa gcaaacaatc cccaggaaat actgaatagg 240 
a 241 



<210> 390 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 390 

gcaggtacat ccacatgttc ctccaaatga 
attgccagct gttcaggtgt catcttatct 
gctaattcca acatctcttt taccactgat 
tgaagtgtca aaccttccat ccaactcttc 
a 



cgtttggggt cctgcttgcc aacattcttt 60 
tcttcttcta cagccttatt gtaattcttg 120 
tcattgcgtt tacaatgttc actgtagtcc 180 
ttatgcaaat ttagcaacat cttctgttcc 240 

241 



<210> 391 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 2, 10, 14, 22, 23, 25, 40, 50, 57, 59, 65, 71, 72, 73, 76, 
77, 78, 82, 83, 84, 95, 98, 100, 101, 102, 107, 148, 152, 
155, 158, 163, 169, 170, 172, 180, 182, 192, 193, 198, 200, 
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202, 203, 206, 207, 208, 213, 214, 218, 220, 224, 225 
<223> n = A,T,C or G 



<221> misc_feature 
<222> 235, 236 
<223> n = A,T,C or G 



<400> 391 

cnggcacaan cttntgtttt tnntnttttt 
taaanaaaaa nnntannnaa annngggttt 
ggggaaaaaa aaaccaaaaa cgagcttntt 
tnaagaagca anntttanan cnngcnnnat 
t 

<210> 392 
<211> 241 
<212> DNA 
<213> Homo sapiens 



tttttttttn tctttatttn tttttantnt 60 
aaatnctntn nncagancat taaaactgaa 120 
anttnacntg ggnttgggnn gntgctgatn 180 
ganngagngn tcannttgaa atttnnaccc 240 

241 



<400> 392 

gaggtactaa atggtatcct tagattaaaa 
cattagaaat aagatgccac acaaggaact 
ccattagtgt gtataacaga ttattgttca 
tataaagaac agtgccttag aagacagtga 
t 

<210> 393 
<211> 241 
<212> DNA 
<213> Homo sapiens 



ttttgtgctt gataacagct gttttttcta 60 
acattccaga tttaaagaaa tgaaaggata 120 
tacttgtaaa gcatcttatg tcattgagaa 180 
aaggtaagct ctagcttaat gtctatgatt 240 

241 



<220> 

<221> misc_feature 
<222> 57, 75, 224 
<223> n = A,T,C or G 

<400> 393 

ggcaggtaca taagcataat cagttatgga 
tacataaact gcctnaaaga tttatgctta 
gcatgttctg aaaatatggg cacattttaa 
tagtcccaca gtatgactga gtaataagaa 
a 

<210> 394 
<211> 241 
<212> DNA 
<2a3> Homo sapiens 



cagcttcttg tataaattgc tattcancaa 60 
caggtagaca ttcaatttac caataaaaca 120 
aacatattaa gacagttctg ttaaccataa 180 
tctacttcaa aagnaaaaaa aaaattaatc 240 

241 



<400> 394 

aggtacagca gcagtagatg gctgcaacaa 
tctgccccac cccaggatcc gggaccaaaa 
tgctgggtag ggctcagtgc cacattactg 
ggcactttaa aaatagagga gtaagcagga 

g 



ccttcctcct accccagccc agaaaatatt 60 
taaagagcaa gcaggccccc ttcactgagg 120 
tgctttgaga aagaggaagg ggatttgttt 180 
ctggagaggc cagagaagat accaaaattg 240 

241 



<210> 395 
<211> 241 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> niisc_feature 

<222> 1, 5r 8, 9, 14, 24, 26, 28, 32, 42, 54 
<223> n = A,T,C or G 

<400> 395 

nggcnggnnc caanatatga aatntnanta tnatacatga tnaaaagctt tatntatttt 60 

agtgagtaat taagtttaca ctgtgaataa ggattaattc ccagatgacc atctacagtt 120 

actaccacat agagggtata cacggatgga tcgattacaa gaatataaaa cttattttcc 180 

ttcctgtatc cacatttctt tgcaatgtga atttgcaggc cctctcaaga agtggagtct 240 
a 241 

<210> 396 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 26 

<223> n = A,T,C or G 
<400> 396 

gaggtacacc ttgaatgaca atgctnggag cccccctgtg gtcatcgacg cctccactgc 60 
cattgatgca ccatccaacc tgcgtttcct ggccaccaca cccaattcct tgctggtatc 120 
atggcagccg ccacgtgcca ggattaccgg ctacatcatc aagtatgaga agcctgggtc 180 
tcctcccaga gaagtggtcc ctcggccccg ccctggtgtc acagaggcta ctattactgg 240 
c 241 

<210> 397 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 90 

<223> n = A,T,C or G 
<400> 397 

ggcaggtacc agcaggggga tgtgtttctg gggaattgtg gctctggaag cttcacggtt 60 
tcccagaatg tggaaaatat atctgtgcan gatagaaatc ctgcccagag gctgtttctg 120 
tctcatttga gctctccttc atgtggcaga gctgactgtg gcggtttagg agcctacatt 180 
ttagaaaagc ttacctcaaa gttctgcatt gagcctgagc actggaaagg agataaaata 240 



<210> 398 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 3, 11, 22, 27, 38, 41, 53, 59, 63, 69, 77, 78, 94, 131, 133, 
137, 149, 154, 162, 166, 167, 172, 175, 176, 179, 191, 230 
<223> n = A,T,C or G 
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<400> 398 

gangtgacca ngacatcacc tnacacntgg 
ccntacccnt tgcccannac ctgaacgcgc 
gttatgaaac nantcanctg gatgaccana 
cattatataa ncggaaagct aatgatgaga 
a 



aaagcganga nttgaatggt gcntacaang 60 
cttntgattg ggacagccgt gggaaggaca 120 
gtgntgaaac cnacanncac angcnntcna 180 
gcaatgatca ttccgatgtn attgatagtc 240 

241 



<210> 399 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 212, 226 
<223> n = A,T,C or G 

<400> 399 

cagagtgaga tgggagtggg agggccaatc 
ctgagcagcc caccccttac cctgacgaag 
ttcagtctgg gttctgcctc agccacgaac 
atgagagtat cccagtgact ccaaacagga 
t 



tgatacagaa gggggtgaag ggtagggccc 60 
gcaatcctcc tctggaatgt ctcttccctc 120 
tgggaaggag tgaggaacat cccaacggca 180 
angaatcagt gttcanaaag tcagggccct 240 

241 



<210> 400 
<211> 241 
<212> DNA 
<213> Homo sapiens 



<400> 400 

ggtactcttg ctcttttagc tagagtgtat 
acaaccatgt gtcttcattt ataacttttt 
cattgatatt atcctctgaa tgcagttaag 
tgccacaggt ctattttgaa gcttttcttc 
t 



g-tgaaaataa agaaatacat cattgtattc 60 
gtttaaaaaa tttttagttc aagtttagtt 120 
gctgggcaga aattctactc atgtgacatc 180 
taatgggcaa tgtttgtcct taccaggatt 240 

241 



<210> 401 
<211> 241 
<212> DNA 
<213> Homo 

<220> 

<221> misc 
<222> 1, i 
<223> n = J 



sapiens 

feature 
.,TrC or G 



<400> 401 

nncaggtact ttgtagagca gagagaggct 
atgtgtcatc acctgggatt tcatctgggc 
ctggtaatga cggatggtat gtaagcgatc 
ggcctggaga atgcgatgag gccgtggcgg 
c 



ttggttcctc ctttcttcaa tcacgtggag 60 
cgccttttct gggtcaacag ccaacacatg 120 
tttgttctca gcacggacat aacgccgtaa 180 
gtcagactgc aaggcagcca ggtagttctc 240 

241 



<210> 402 
<2ll> 241 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc__f eature 

<222> 26, 27 

<223> n = A,T,C or G 

<400> 402 

ggcaggtcca aaaaaaacct aaaaanngtt 
tagcgaaaaa gtgcaccata attactgctg 
tcttaaataa ctatcttgtc agataacaca 
atttacatat acttctaaag tcttattggg 
t 

<210> 403 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 49 

<223> n = A,T,C or G 



tcaggaatgt agagaaatat ccaacttaaa 60 
cactgcagtc atttctgcaa ttcccatgtt 120 
caatataaag agcaattatg aaaaacagac 180 
aatatcctgt ttggccattg ggataaccaa 240 

241 



<400> 403 

aggtgttaac tacccgctcc gagacgggat 
gccggtcatg tccaaagtaa tggagatgtt 
ctcagactcc ctatctgggg atcggttagg 
caagtgtgtg gaatttgtca agagctttaa 
t 

<210> 404 
<211> 241 
<212> DNA 

<213> Homo sapiens 



tgatgacgag tcctatgang ccattttcaa 60 
ccagcctagt gcggtggtct tacagtgtgg 120 
ttgcttcaat ctaactatca aaggacacgc 180 
cctgcctatg ctgatgctgg gaggcggtgg 240 

241 



<400> 404 

caggtactgc aacccataaa atactgtttc 
ttctgtcttc ttttcacggc attcaaagta 
ttgtttatag tgagtaacct tgtaggagtc 
tgccgcagga ttcatctcgg gccggaggac 
c 



ctcatatttc accttcctta atttggagtt 60 
ggaataaact ttgcttgtgt tgggtggata 120 
ggtggccagg aggatgttga actcggcttc 180 
aaggggcccg cgcgccgcga gctccctgac 240 

241 



<210> 405 
<211> 266 
<212> DKA 

<213> Homo sapiens 
<400> 405 

ttctgggctg gggagtggag agaaagaagt tgcagggctt acaggaaatc ccagagcctg 60 
aggttttctc ccagatttga gaactctaga ttctgcatca ttatctttga gtctatattc 120 
tcttgggctg taagaagatg aggaatgtaa taggtctgcc ccaagccttt catgccttct 180 
gtaccaagct tgtttccttg tgcatccttc ccaggctctg gctgcccctt attggagaat 240 
gtgatttcca agacaatcaa tccaca 266 

<210> 406 
<211> 231 
<212> DNA 

<213> Homo sapiens 
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<400> 406 

ttggtgaaga accattcctc ggcatccttg cggttcttct ctgccatctt ctcatactgg 60 

tcacgcatct cgttcagaat gcggctcagg tccacgccag gtgcagcgtc catctccaca 120 

ttgacatctc cacccacctg gcctctcagg gcattcatct cctcctcgtg gttcttcttc 180 

aggtaggcca gctcctcctt caggctctca atctgcatct ccaggtcagc t 231 

<210> 407 
<211> 266 
<212> DNA 
<213> Homo sapiens 



<400> 407 

cagcatcatt gtttataatc agaaactctg 

ttgtgccata atgcagcagt atggagggag 

tgaccacaaa taaataaagg aaaactaagc 

cttcttaaca attctttttt tcagggactt 

tttgaaaagc attcccaaaa tgctct 

<210> 408 
<211> 261 
<212> DNA 
<213> Homo sapiens 



gtccttctgt ctggtggcac ttagagtctt 60 
gattttatgg agaaatgggg atagtcttca 120 
tgcattgtgg gttttgaaaa ggttattata 180 
ttctagctgt atgactgtta cttgaccttc 240 

266 



<400> 408 

ctgtgtcagc gagcctcggt acactgattt 
acttttcagg gaattactga actttcttct 
ctcacttgaa gggtetgcat ttgggtcctc 
agggagtaat atctggaggg caaagaagag 
gggaggagca tgggcatggg t 

<210> 409 
<211> 266 
<212> DNA 
<213> Homo sapiens 



ccgatcaaaa gaatcatcat ctttaccttg 60 
cagaagatag ggcacagcca ttgccttggc 120 
tggtctcttg ccaagtttcc cagccactcg 180 
acttatgtta ttgttgaacc tccagccaca 240 

261 



<400> 409 

gctgacagta atacactgcc acatcttcag 

ctgtcccaga cccgctgcca ctgaatcggt 

agtaaatgag cagtttagga ggctgtcctg 

tattgttgga gctgtctaaa acactctggc 

ccagagacac agccagggag tgtgga 



cctgcaggct gctgatggtg agagtgaaat 60 
cagggatccc ggattcccgg gtagatgccc 120 
gtttctgctg gtaccaagct aagtagttct 180 
tggtcttgca gttgatggtg gccctctcgc 240 

266 



<210> 410 
<211> 181 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 9, 17, 24, 26, 65, 97, 98, 99, 100, 103, 105, 106, 107, 108, 
120, 121, 123, 142, 145, 149, 162, 177 
<223> n = A,T,C or G 

<400> 410 

caaaaggtnc tttttgntca aaancnattt ttattccttg atatttttct tttttttttt 60 

tttgnggatg gggacttgtg aatttttcta aaggggnnnn ttnannnngg aagaaaaccn 120 

ngntccggtt ccagccaaac cngtngctna ctttccacct tntttccacc tccctcnggt 180 
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t 181 

<210> 411 
<211> 261 
<212> DKA 
<213> Homo sapiens 

<400> 411 

gcccctgcag tacttggccg atgtggacac ctctgatgag gaaagcatcc gggctcacgt €0 

gatggcctcc caccattcca agcggagagg ccgggcgtct tctgagagtc agggtctagg 120 

tgctggagtg cgcacggagg ccgatgtaga ggaggaggcc ctgaggagga agctggagga 180 

gctggccagc aacgtcagtg accaggagac ctcgtccgag gaggaggaag ccaaggacga 240 

aaaggcagag cccaacaggg a 261 

<210> 412 
<211> 171 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 1, 6, 53, 79, 91, 96, 114, 132 
<223> n = A,T,C or G 

<400> 412 

nttttntctt tacaattcag tcttcaacaa cttgagagct ttcttcatgt tgncaagcaa 60 

cagagctgta tctgcaggnt cgtaagcata nagacngttt gaatatcttc cagngatatc 120 

ggctctaact gncagagatg ggtcaacaaa cataatcctg gggacatact g 171 

<210> 413 
<211> 266 
<212> DNA 
<213> Homo sapiens 

<400> 413 

ttaggaccaa agatagcatc aactgtattt gaaggaactg tagtttgcgc attttatgac 60 

atttttataa agtactgtaa ttctttcatt gaggggctat gtgatggaga cagactaact 120 

cattttgtta tttgcattaa aattattttg ggtctctgtt caaatgagtt tggagaatgc 180 

ttgacttgtt ggtctgtgta aatgtgtata tatatatacc tgaatacagg aacatcggag 240 

acctattcac tcccacacac tctgct 266 

<210> 414 
<211> 266 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> iaisc_feature 

<222> 86, 153, 162, 178, 184, 205 

<223> n = A,T,C or G 

<400> 414 

tttgccataa ttgagtgaaa agtggcagat ggcattaact ctgctccgct tcaagctggc 60 

tccatgacca ctcaaggcct ccccancctg ttcgtcaagt tgtcctcaag tccaagcaat 120 

ggaatccatg tgtttgcaaa aaaagtgtgc tanttttaag gnctttcgta taagaatnaa 180 

tganacaatt ttcctaccaa aggangaaca aaaggataaa tataatacaa aatatatgta 240 

tatggttgtt tgacaaatta tataac 266 
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<210> 415 
<211> 266 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 
<222> 37, 103, 223 
<223> n = A,T,C or G 



<400> 415 

cctccatcca gtctattaat tgttgccggg 

gtttgcgcaa cgttgttgcc attgctacag 

tggcttcatt cagctccggt tcccaacgat 

gcaaaaaagc ggttagctcc ttcggtcctc 

tgttatcact catggttatg gcagca 

<210> 416 
<211> 878 
<212> DNA 
<213> Homo sapiens 



aagctanagt aagtagttcg ccagttaata 60 
gcatcgtggt gtnacgctcg tcgattggta 120 
caaggcgagt tacatgatcc cccatgttgt 180 
cgatcgttgt canaagtaag ttggccgcag 240 

. 266 . 



<400> 416 

cctgacgata gccatggctg taccacttaa 

tatcacaaaa tgacttgtac acagtagttt 

aaaaagacgc ctcaaaattc actcaacttt 

agctatgctg caactgaggg cacatatcat 

ctggaaaaaa tctcatacta agcaaacagt 

atctgctcag cctgccgcta acagatctca 

caaagtcaga ttcggtgcta accaggtggc 

acaaagggct ctgaaggagg tgttctccaa 

ttgcaccttg aattcaattg catcaagtgt 

atatgtgttc agcctaagat tttacccacc 

ggccagtgag gggatggggg agaaaaaaaa 

aaaagggctc ccttcactat tcaggaaggg 

cacagcagcc ctttttggct gcttccacaa 

ctatctgtga cctgccctgc ggataaacac 

caaagggcta ttccctcctt tcataacaac 

<210> 417 
<211> 514 
<212> DNA 
<213> Homo sapiens 



ctatgattct attccaactg ttcagaatca 60 

acaacgactc ccaagagagg aaaaaaaaaa 120 

tgagacagca atggcaatag gcagcagaga 180 

tgaagatgtc acaggagttt aagagacagg 240 

agtatctcat accaagcaaa accaagtagt 300 

caatcaccaa ctgtgcttta ggactgtcac 360 

atctatgatc aacgtcgccc ctcttattta 420 

gcaacaagga gactgcttca gtacaagact 480 

ggatagcaaa ataagtatct taccattgaa 540 

agcagaacaa aagtgagggt gagagggatg 600 

atcacaggat taccaccaaa gccttgtttt 660 

aagtggaagg agaaattaac caattcctgc 720 

tagatacttt atggagtggc acagccaacc 780 

agccaagcag gtttaattag atcaaagaca 840 
gcagacct 878 



<400> 417 

ttctgacttc tagaagacta aggctggtct 

tacccagaga acctgggcac ttgctgcctg 

cacccagcgg gaggtgggat gtgagacagc 

agggatttgc ccttcacaat tctactcccc 

cagggcagga ccctaagatc tggggaaagg 

tccttttcta cccactttcc tatggaggat 

accagggtcc aggactaagg cgttttctcc 

aatcaccctt gctcctcctg ggtgcctgga 

ttttgtgctt tgatgccagg aatgccgcct 



gtgtttgctt gttt.gcccac ctttggctga 60 

atgcccaccc ctgccagtca ttcctccatt 120 

ccacattgga aaatccagaa aaccgggaac 180 

agatcctctc ccctggacac aggagaccca 240 

aggtcctgag aaccttgagg tacccttaga 300 

tccaagtcac cacttctctc accggcttct 360 

atagcctcaa cattttggga atcttccctt 420 

agatggactg gcagagacct ctttgttgcg 480 
agtt 514 



<210> 418 
<211> 352 
<212> DNA 
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■<213> Homo sapiens 



<400> 418 

ctgcaccagc gattaccagt ggcattcaaa 
ccagtagaac cagaatcaga caggtatgag 
agtaggctca ggatctgctg aaggtcggag 
tcctgaagcc cttggtgata ttttgccact 
tctctatgtc ccgaacctgg aacccatcca 
ccttcacctt ggtggaaaaa aaaaaaaaaa 

<210> 419 
<211> 344 
<212> DWA 
<213> Homo sapiens 



tactgtgtga ctaaggattt tgtatgctcc 60 
ctagtcaaca gcaagtcttt gttggattcg 120 
gagttagtcc ccgcaatcaa gagcctgtct 180 
cagccaagaa tgaggatgca tccttcagat 240 
cgccagcttg cagccaaaac tccagagcat 300 
aaaaaaaaaa aaaaaaaaaa aa 352 



<400> 419 

ctggacacca taatcccttt taagtggctg 
ggttaagtca ataggttgac taggatcaac 
attgagactc aaaggcttat actggcgtct 
tgggattcgg atgtaaaatg gagtctggcc 
tctttgcctt tctcctttat ggcctctgcc 
tcttctctcc ggtttcttgg agccgggatt 



gatggtcaca cctctcccat tgacaagctg 60 
acgacccaaa tcaataagat actgcagtct 120 
gaaactatgt ccttcgttaa acccgtattt 180 
tccctcaaag cccaagcggg gccgggttcc 240 
acattttcta cctcttctcc gacctcttgg 300 
cggctttaag ttgg 344 



<210> 420 
<211> 935 
<212> DNA 
<213> Homo sapiens 



<400> 420 

cgaaagtcaa cgttaagggg ctcaggtgaa 
atattggctc ttgtgggtga caaaagccag 
cgaggttctc aaagatccaa aggagggaaa 
acacacgtgt cctcatgatc ttaaatgcct 
tgtggtcaga agagatttct acaaaagcac 
catgtggcag ttggtttgtg gagttgggca 
tgctgcttct gccccctggg acccagcaca 
tctcctgatg ctgagccctg caccaccacc 
agtccactca aaacaaccag ttaagtgctc 
aaaccgggtt tttgttcaca tagcaggaag 
acacaggtag attggcagaa aaacgggaac 
attactttgg gataggcttt ctcagtcttt 
tggcaattct gagtgacttg cgcttgtctt 
ggaaaacttt tactgaaaca gcgaagcaga 
tcacctatgt accactcttt gacaataaat 
agtaaaaaaa ctgaaatcgc aagtcaaaaa 

<210> 421 
<2ai> 745 
<212> DNA 
<213> Homo sapiens 



ccatgatgat gaccttctgt tgactttgaa 60 
acaagctgtg gctgtggtcc gattttaaga 120 
gggtattgga aacactgtgt atcatctgag 180 
actttaaagc cacctaatac tgcccttcat 240 
tcagaattct ggaggcagtt gtgattttgc 300 
ggtgtgaaag ggtaaaactc cacttctgaa 360 
ttgttagacc atcttcttga ctgaaaattc 420 
ttccttttcc taactatgaa ttgatggcaa 480 
acgagagagt agtcaagcac ctccagaaag 540 
tgactccctg ggtggtaatt tatcttggaa 600 
atgtaggtac cgcgatgttg gtgcatgtcc 660 
cctcaaatga tagttgagcc agttttccag 720 
atggtgtggt caagggacgt tcagaactac 780 
gtataccggc atgagaggga agatgaacac 840 
atagtatttc tcaaaaaaaa aaaaaaaaaa 900 
atcca 935 



<400> 421 

ggcttcgagc ggccgcccgg gcaggtccta 
tgaagccctg tttgagtccc tgggatatgt 
ggttaacaac agtcccctgc ttggcttcta 
cctgaagggt ggctgatgca tatggtacaa 
gagtggatgt gttctgtagc atcctattta 
actctgttat ctgctgcttg tactggtgcc 



gatgtcattt gggacccttc acaaccattt 60 
gagctgtttc tatgcataat ggatattcgg 120 
ttctgaatcc ttttctttca ccatggggtg 180 
tggcacccag tgtaaagcag ctacaattag 240 
aataagccta ttttatcctt tggcccgtca 300 
tgtacttttc tgactctcat tgaccatatt 360 
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ccacgaccat ggttgtcatc cattacttga 
ggtggtgaat aggcttcttt ttacatggtg 
ggacttttag caagcgggct cactggaaga 
gtttggggtt tttttctttc ttaatcgaaa 
ctactgcgga acctcaaaat cagtagattt 
ggccctcctt gtgttctaat tgcttgcaag 
tttgcttctt tgttagttgt cagac 

<210> 422 
<211> 764 
<212> DNA 
<213> Homo sapiens 



tcctacttta catgtctagt ctgtgtggtt 420 

ctgccagccc agctaattaa tggtgcacgt 480 

gactgaacct ggcatggaat tcctgaagat 540 

gttaacattg tctgaaaagt tttgttagaa 600 

ggaagtgatt caaagctaaa ctttttcctt 660 

tgtaatacta ggatgtccaa gatgccagtt 720 

745 



<400> 422 

gagttcagta gcaaagtcac acctgtccaa 
gatggcaaag gtggtggtgc tttcatcttc 
ggctgcaggc ccagttctca tgctgccctt 
ctgggtggcg gcagcagctt tgctctgagt 
catcatcatt attaaacttc agaaaagcag 
cactgcttaa gtgcctgcag gagccgcctg 
ggggtctgca caaggctttg tcaaccaaag 
tttggagcca agaaacactc tgtgtgactc 
agtcattgat gcaacttgaa aggaaacagt 
cttctgtttt tttattgaga aaatgattca 
taggacgtga cggtactggg ccctgtgatt 
ggaaaagctc tttacttccg cccctggcag 
tgggaatgag gaaaatatga actacagcag 

<210> 423 

<211> 1041 

<212> DNA 

<213> Homo sapiens 



ttccctgagc tttgctcact cagctaatgg 60 

aggcagaagc ctctgcccat ccccctcaag 120 

gggtgggcat ctgttaacag aggagaacgt 180 

gcctacaaag ctaatgcttg gtgctagaaa 240 

cagccatgtt cagtcaggct catgctgcct 300 

ccaagctccc cttcctacac ctggcacact 360 

acagcttccc ccttttgatt gcctgtagac 420 

tacacacact tcaggtggtt tgtgcttcaa 480 

ttaatggtgg aaatgaacta ccatttataa 540 

cgaattccaa atcagattgc caggaagaaa 600 

ctcccagccc ttgcagtccg ctaggtgaga 660 

ggacttctgg gttatgggag aaaccagaga 720 

aagcccctgg gcag 764 



<400> 423 

ctcagagagg ttgaaagatt tgcctacgaa 
caggatgtct gacttcaaat tgaaactccc 
gcctttccag gatgggggtc ttttctgctc 
tggttgggcg tgttgctttc ccaaaggttt 
taggcattat tatctttact ttgtctccaa 
agctcagggc cacagtgcga tgaggaccat 
gcagagtaac gtggaagtgg tccacaccta 
cggcaacctg cacctgttca tcaatgccta 
tcattgctat ttatgaggtg tcttctgtag 
gtataaaatg agcggaagag gcaggctctg 
gtccccagga aggacacctg gcctgtaagc 
gatctgaaca aacactccag accactgggg 
ctcagagggt tgagagtaaa tatgtgtgcc 
aagaagagct aagtgagaga gcagcaaagc 
gcttctggga gtttgtcatt ccaaggataa 
cgtggccctc tcaaccagca ctaggtgctt 
tcagggccat tggtgtcagc cagagactct 
tgagctcgct ctgccacgca c 



agggacagtg atgaagctaa gctctagatc 60 
aaagtaatga gtttggaagg gtggggtgtg 120 
ccagcggata gtgaaacccc tgtctgcacc 180 
tttttttagg tccgtcgctg tcttgtggat 240 
ataacctgga gaatggagag agtagtgacc 300 
cttctcacct ctctaaatgc aggaagaaac 360 
ccgccagcac attgtgaatg acatgaaccc 420 
caacaggtat tgggatgtag ttcagccaca 480 
atccgaaatg tgggacagat gagagggaga 540 
agtttgagca aatagattaa taggacaggt 600 
tggttcctgg cattcagctc gccttgcagg 660 
gtgcagacgt gagagggacg cagtcgcaca 720 
cgctgctgac cttcacgaaa ggccaaatgt 780 
actcctggag gocggggata atccaggcag 840 
ggaggacctg aacatggcct ttgcctaagg 900 
atctggagct cagctagggg aggagacagc 960 
gtaatcttcc agggagctcg ctcaacctgc 1020 

1041 



<210> 424 

<211> 1288 

<212> DNA 

<213> Homo sapiens 
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<400> 424 

ctaagaactg agacttgtga cacaaggcca acgacctaag attagcccag ggttgtagct 60 

ggaagaccta caacccaagg atggaaggcc cctgtcacaa agcctaccta gatggataga 120 

ggacccaagc gaaaaaggta tctcaagact aacggccgga atctggaggc ccatgaccca 180 

gaacccagga aggatagaag cttgaagacc tggggaaatc ccaagatgag aaccctaaac 240 

cctacctctt ttctattgtt tacacttctt actcttagat atttccagtt ctcctgttta 300 

tctttaagcc tgattctttt gagatgtact ttttgatgtt gccggttacc tttagattga 360 

cagtattatg cctgggccag tcttgagcca gctttaaatc acagctttta cctatttgtt 420 

aggctatagt gttttgtaaa cttctgtttc tattcacatc ttctccactt gagagagaca 480 

ccaaaatcca gtcagtatct aatctggctt ttgttaactt ccctcaggag cagacattca 540 

tataggtgat actgtatttc agtcctttct tttgacccca gaagccctag actgagaaga 600 

taaaatggtc aggttgttgg ggaaaaaaaa gtgccaggct ctctagagaa aaatgtgaag 660 

agatgctcca ggccaatgag aagaattaga caagaaatac acagatgtgc cagacttctg 720 

agaagcacct gccagcaaca gcttccttct ttgagcttag tccatccctc atgaaaaatg 780 

actgaccact gctgggcagc aggagggatg atgaccaact aattcccaaa ccccagtctc 840 

attggtacca gccttgggga accacctaca cttgagccac aattggtttt gaagtgcatt 900 

tacaagtttc tggcatcact accactactg attaaacaag aataagagaa cattttatca 960 

tcatctgctt tattcacata aatgaagttg tgatgaataa atctgctttt atgcagacac 1020 

aaggaattaa gtggcttcgt cattgtcctt ctacctcaaa gataatttat tccaaaagct 1080 

aagataaatg gaagactctt gaacttgtga actgatgtga aatgcagaat ctcttttgag 1140 

tctttgctgt ttggaagatt gaaaaatatt gttcagcatg ggtgaccacc agaaagtaat 1200 

cttaagccat ctagatgtca caattgaaac aaactgggga gttggttgct attgtaaaat 1260 

aaaatatact gttttgaaaa aaaaaaac 1288 



<210> 425 
<211> 446 
<212> DNA 
<213> Homo sapiens 

<400> 425 

ccacttaaag ggtgcctctg ccaactggtg 
gcctaacatc ttccacaagg atcccgatgt 
atggcagccc atcgagtacg gcaagaagaa 
cgaatacttc ttcctgattg ggccgccgct 
catcatgacc atgatcgtcc ataagaactg 
catccggttc ttcatcacct acatcccttt 
caacttcatc aggttcctgg agagccactg 
cgtcatggag attgaccagg aggacc 

<210> 426 
<211> 874 
<212> DNA 
<213> Homo sapiens 



gaatcatcgc cacttccagc accacgccaa 60 
gaacatgctg cacgtgtttg ttctgggcga 120 
gctgaaatac ctgccctaca atcaccagca 180 
gctcatcccc atgtatttcc agtaccagat 240 
ggtggacctg gcctgggccg tcagctacta 300 
ctacggcatc ctgggagccc tccttttcct 360 
gtttgtgtgg gtcacacaga tgaatcacat 420 

446 



<400> 426 

tttttttttt tttttttttt ttttttcaat 
acattctaca atggaaactt ttattaaatg 
tacagccatg ctgtttcaga agacttgaaa 
gatggagaat cgaggaagac aatttaatgt 
aatgacagct ccacttggca aataatagct 
tggtgatgga taaattcaga aatgcttccc 
ggtcacaacc cttgcagaaa acactgatgc 
cgggtcttcc aagttccaag gggctggggt 
caagagggtc ctttggactg gatagggagc 
ggatggcagt gtaaaagatg atccaaatga 
gacccaggag tgccactgta gggctgcttc 
tccagagcag caatggcctt tcctgtaaca 
cctcgtaatg gcaaaactcc ccaaatgaca 



taaagatttg atttattcaa gtatgtgaaa 60 
ctgcatgtac tgtgctatgg accacgcaca 120 
tgccattgat agtttaaaaa ctctacaccc 180 
ttcatctgaa tccagaggtg catcaaatta 240 
gttacttgat ggtatccaag aagaaatggt 300 
caaaggtggg tggtttttaa aaagttttca 360 
ccaacacact gattcgcggt ccaggaaaca 420 
tccccaacga tcaagttcct gtgctgtaat 480 
acttgggagc tgtacaccat cagtcataat 540 
cctgagatgc tcctgaggag tggtgcacca 600 
tttgctttag tcatcacaca cacacacagc 660 
ggaaaaaagc ctcctgctat tcccaagaac 720 
cccaggacca cagcaatgat ctgtcggaac 780 
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cagtagatca catctaaaaa ttcatcctta tcctcccagg ccgcgtcgct ccgcagcacc 840 
ttactccaga cggagacttt gagggccccg ttgg 874 

<210> 427 
<211> 638 
<212> DNA 
<213> Homo sapiens 



<400> 427 

acttgtaatt agcacttggt gaaagctgga 
ttgatttttc ttcttgaaag agtaaggttt 
aaaaatgata gtgattttga tgtaatttat 
ataatttaag ttgctatcag ctgatattag 
ttttatgggc gggtgccaaa tactgctgtg 
ttatggaaat agatatttgt gcagctcaat 
tggatgaaaa cttgcataga attctgatta 
gaagtattta aataaccact cctttcacag 
tagcatgtgg attttaaaag atttgccctc 
tttcaaaata tttttgcaaa ttgagataag 
tgcaaaaaca agatgtttgt agctgtttca 

<210> 428 
<211> 535 
<212> DNA 
<213> Homo sapiens 



aggaagataa ataacactaa actatgctat 60 

acctgttaca ttttcaagtt aattcatgta 120 

ctcttgtttg aatctgtcat tcaaaggcca 180 

tagctttgca accctgatag agtaaataaa 240 

aatctatttg tatagtatcc atgaatgaat 300 

ttatgcagag attaaatgac atcataatac 360 

aatagtgggt ctgtttcaca tgtgcagttt 420 

tttattttct tctcaagcgt tttcaagatc 480 

attaacaaga ataacattta aaggagattg 540 

gacagaaaga ttgagaaaca ttgtatattt 600 
gagagag't 638 



<400> 428 

acaagatgat tcttcctcct caatttgaca 
tgtatggtgg tccctgcagt cagagtgtaa 
atcttgcaag taaggaaggg atggtcattg 
aaggtgacaa actcctctat gcagtgtatc 
agattacagc tgtcagaaaa ttcatagaaa 
tatggggctg gtcctatgga ggatacgttt 
ttttcaaatg tggtatagca gtggctccag 
acacagagag attcatgggt ctcccaacaa 
caactgtgat ggcaagagca gaatatttca 



gatcaaagaa gtatcccttg ctaattcaag 60 

ggtctgtatt tgctgttaat tggatatctt 120 

ccttggtgga tggtcgagga acagctttcc 180 

gaaagctggg tgtttatgaa gttgaagacc 240 

tgggtttcat tgatgaaaaa agaatagcca 300 

catcactggc ccttgcatct ggaactggtc 360 

tctccagctg ggaatattac gcgtctgtct 420 

aggatgataa tcttgagcac tataagaatt 480 

gaaatgtaga ctatcttctc atcca 535 



<210> 429 
<211> 675 
<212> DNA 
<213> Homo sapiens 



<400> 429 

actattttca accctgagca ttaacactgc 
ttagatcgaa gcacaagcac aagccactga 
aaatacatag ttttcaataa ataatgctta 
caccgtttca acacactaca ctctgcatgc 
catgcatttt ctttcccccc tagtgctatc 
ggtagcttta ccttctctct gtttcacagc 
aaaaaggtcc cccccacaaa ccactcagac 
gctcccaaac ctggagtggc taagaaagta 
tgttaacagt gtcatgctga aaactgctct 
gtcacgaagc attatgggat gccataacca 
tccgttttaa aacagtcaat tcaaaaaagg 
acccacaaca tatgt 



ataccaaggg ggggtgggtc aagaagctgg 60 
tattctctat gtgatcaggt ttttacaaaa 120 
attttacaac tttgatacag caatgtcata 180 
tagatagtct acgagaagac gaaactttgc 240 
aaacacttca tcctccagcg cactgcctca 300 
aataggccgt gcgctggcat gcaaactcta 360 
ttctacacaa aagggttttt cagcttttct 420 
agtttcatgt ggccttggaa aatacacact 480 
aaaacatcag gtggttctgt cctggtggcc 540 
ctaggagtcc caaaccggaa aaaataggcc 600 
tgtcacagaa caaatgcaaa agactcttaa 660 

675 



<210> 
<211> 



430 
434 
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<212> DNA 

<213> Homo sapiens 

<400> 430 

acctctgcca gaagtccagc gagaggacct 
catcagaaga ttcatcctca tggaggaaga 
gagccacctt gtccattgcc agggacttgg 
gctggaatgc tgggcttgac cagtgagcag 
gattgtcata ctcatagcct tgggctgaaa 
cacctgggac agtcatctca aagtcggtgc 
ggccataaag atccttgaac actcgccgtt 
gggaaacatt gtcg 

<210> 431 
<211> 581 
<212> DNA 
<213> Homo sapiens 



cacagtagag cacaggccac tccgggagtg 60 
aggcttcaaa cgtgaatggg taggagaagt 120 
tggtgcaggt ctgtgttact cctgagagct 180 
ttggcaattc tacaaagaag tggacgtaga 240 
cgacctctcc atttacaaag agccggaggg 300 
ctacgaggct gctgagatac tccttgtgcc 360 
cccgctcctc ctcctccggc tgtgcgtggg 420 

434 



<400> 431 

acacaagcct ccagcccgac ccagcggcct 

ggccacgagt atccagctcc aagcccaagt 

caagtgacca agaccagaag cagggacgat 

gaccctgtct ctgccctcct tccctggcct 
tgcctttgca aggacctgct ctgtccactc 

acagactccc atctctctgc tcatagtggt 

tcagtagaag gctcattagt gtgaaggagt 

ttctatgctt gtttcctaaa cacttggtcc 

caagcaccac tggatggctc tggagccagg 

cccactctgc tcatctgtgc ctccaccctg 

<210> 432 
<211> 532 
<212> DNA 

<213> Homo sapiens 



aatgaaactc tggcaaccta tcctgggcgt 60 
gaggcgggga gtcaacttcc ccatgattgc 120 
taggctagtt ctgcggcaag gtgaactgga 180 
gtcccacaga catcccgttg tttaacccac 240 
caaatcaaag gatacttgca tccttcttac 300 
cccaggctgc ccgagaaaaa gaaacttggg 360 
gagaggccag gccttcctgt gacataatgc 420 
acacacaata cctgggcagg aagagagaac 480 
ggacttctat gcacatacaa ccaacatcac 540 
aacagcagag t 581 



<400> 432 

actccaactc aagtttacaa gttacacctt 
tgcagaattc agctgtggta gagtgctgat 
cttgacagtc atgtgctttg taagtccttg 
ctgcataact ggaagaagag attcttcagt 
ccattcacca ttatccagaa ttttcagtgc 
aggaggaaag tgcaccatgt catagtccaa 
agtatgttgc tcgacatcaa cctctccaat 
tagaggccga cccagaccaa agtttaaagc 
ttggtgctta gtataagtgt tgtcagtcac 

<210> 433 
<211> 531 
<212> DNA 
<213> Homo sapiens 



tgccacagcc ttggctaaat cttgaactag 60 

cttagcatgc ttcgatgtgg catacttgtt 120 

atttaccatg actacattct tagccaggtg 180 

atatgacagg taatgttgta gagttggtgt 240 

taagcaaaaa gctcctgctg caatttgaga 300 

catagttagt tccatcaggt atttggccaa 360 

cttagatgct ctccgaagga agtgcaaagg 420 

tcttagaatc ttcatttcca tctgtctgat 480 

aaaagcaaag tcaccaattt ct 532 



<400> 433 

acttggtttt acagctcctt tgaaaactct 
acactacagt ggtttagaaa ttactaattt 
tgaaatgact tcttaaatat ttagttgata 
ctcttttata tgctaaagga gcatctatca 
atattgagat gacactaggt gcaatagcag 
ccttgagatc tgtggtggtc ttcaaaatgg 



gtgtttggaa tatctctaaa aacatagaaa 60 
tacttctaag tcattcataa accttgtcta 120 
gactgctaca ggtaataggg acttagcaag 180 
gattaagtta gaacatttgc tgtcagccac 240 
ggatagattt tgttggtgag tagtctcatg 300 
tggccagcca gatcaaggat gtagtatctc 360 
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atagttccca ggtgatattt ttcttattag aaaaatatta taactcattt gttgtttgac 420 
acttatagat tgaaatttcc taatttattc taaattttaa gtggttcttt ggttccagtg 480 
ctttatgttg ttgttgtttt tggatggtgt tacatattat atgttctaga a 531 

<210> 434 
<211> 530 
<212> DNA 
<213> Homo sapiens 



<400> 434 

acaagagaaa 

agaagttgca 

ctctcaagat 

tcctcatatc 

ggaagatgat 

gcagaggaag 

ctttgcacct 

cgaagacttc 

aaaatcccca 



acccctaaaa 
ctagctttat 
aaaagcattg 
tctaattgca 
gttggtggtg 
attcttctgg 
ggtgaagatt 
tctatgagaa 
gtagaaaaga 



aaaggatggc 
cagtgaagga 
aaaaacatgg 
gtgtagccag 
ttcaagggaa 
aaggcagtga 
ctgaggatga 
aaagtaaagt 
aagagaagaa 



tttagatgac 
acttccaaca 
cagtagtaaa 
tgattattta 
aagaaaagca 
tggtgatagt 
ttctgatttt 
taaagaaatt 
atctaaatcc 



aagctctacc 
gtcaccacta 
atagaaacaa 
gatttggata 
gcatctaaag 
gctaatgaca 
tgtgagagtg 
aaaaagaaag 
aaatgtaatg 



agagagactt 60 
atgtgcagaa 120 
tgaataagtc 180 
agattactgt 240 
ctgcagcaca 300 
ctgaaccaga 360 
aggataatga 420 
aagtgaaggt 480 
530 



<210> 435 
<211> 677 
<212> DNA 
<213> Homo sapiens 



<400> 435 

accttatgat 

ccaatgacta 

tccaaaatca 

gtttcactgc 

tgatttttgt 

accaatttat 

gtccattcaa 

ctcttactgg 

gtcagcacct 

atgtcaacat 

atggctcctt 

catgctttgg 



ctaattaata 
gagtcaacat 
tttataaaat 
agatagtcac 
tattttggat 
cagacagaag 
gtgccgcttt 
aaggcttttg 
gacagcttga 
gttcacattc 
catcaatgtt 
ttatggt 



gatattagaa 
taaagagttg 
gtatttggct 
aaagatgcag 
cttctgtttt 
catgtcatct 
atggctaata 
ctcagcagcc 
gtcactgctc 
ttctgtgttc 
catacctgaa 



acagtagaaa 
taatttaagt 
ttggaatcca 
atacactgaa 
tttcttatta 
tgttgttcaa 
cgcttctctg 
ttggtctggt 
cgagagtcga 
tgtaaaactg 
ttctcttcat 



gacaagttac 
aatccaaact 
caggacttca 
atacttaaga 
tggtccgaag 
gataatccag 
gattcagttc 
cctcagcact 
acc act gate 
ttgctaaatt 
tgccagggaa 



acgtcaatgc 60 
gacatctaat 120 
aacaagcaaa 180 
gccttattaa 240 
cctccttaat 300 
taaattttca 360 
tgtttttcta 420 
ttcactgtca 480 
aatattctca 540 
agctgctaaa 600 
aagttttttc 660 
677 



<210> 436 
<211> 573 
<212> DNA 
<213> Homo sapiens 



<400> 436 

acctcttagg 

agggtagtag 

taaggataaa 

gagtgcttca 

atgctgtttg 

ctttgtgatg 

gcaaatacag 

gtcaccgtct 

acaccctgtc 

tcgctcagac 



gtgggagaaa 
attagcatca 
atttggccag 
gtattgtgtt 
aagttttgtc 
aggaatggga 
tagggtagtt 
gtcaatgaat 
tttagggctg 
tgcaagtatg 



tggtgaagag 
tccggataga 
ttgacagatt 
cctgtaaatt 
ctattggaaa 
tgggctaatt 
tagttcttta 
attatagaag 
acctttcgtc 
tttgtattaa 



ttgttcctac 
tgtgaagagg 
ctgtttccag 
taattttgat 
agtcttgtgt 
ttttgccgtt 
cacagaacat 
gtatgaaggt 
ctttgacctc 
tgt 



aacttgctaa 
acggctgttt 
cagtttttac 
ccgcaatcat 
tgcaggggtg 
ttcttggaat 
gataaactac 
gtaattacca 
ctcagcctcc 



cctagtggac 60 
ggataataat 120 
agcaacagtg 180 
ttggtataca 240 
cagttaagat 300 
■fcgggggcatg 360 
acctgttgat 420 
taataacaaa 480 
attcccatct 540 
573 



<210> 
<211> 



437 
645 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> iaisc_f eature 
<222> 605 

<223> n = A,T,C or G 



<400> 437 

acaattggta tccatatctt gttgaaattg 
tgtagatagt gggtttttgt tttcataata 
aggactgcat aatagatcac cacaatcata 
catggcttct gtgggttaga agtatgctca 
tctggctgca atcaaggtgt cagctgggct 
gagtctgctt ccaaggtcat gaagtttgct 
ctgaaatccc aagctatctc ctgactttta 
tacagttcct agaggctggt cacagttctt 
tgcttcatca agtcagcaag aatagcctgt 
taatttccct actgatgagc caaacactaa 
attcngttca ccagaggcaa gtcatattca 



taatgggaaa acaatatatt tcaatctcta 60 
tattctttta gtttactgta tgagttttgc 120 
acatcttagg accacagaca tttatgagat 180 
tgtcttaact gggtcctctg ctcagtctta 240 
gaattttcat ttggaatctt gactgggaaa 300 
ggcaaaatgt atgtttttat gacagtatga 360 
gctgggtaat ctcaggccct aaatgttgcc 420 
agccatgtgg atttcctcaa catggctgct 480 
catatcagtg tatatcaggc tcactcagga 540 
ctgattttag agcttaacta catctgcaaa 600 
gggaaggaga agtgt 645 



<210> 438 
<211> 485 
<212> DNA 
<213> Homo sapiens 



<400> 438 

acagaattga gagacaagat tgcttgtaat 
atttctgatg aaaatgaagg aaaagaaatg 
tcccaggtag aacagcagca gctcatcact 
cctacaccct cacttgttgt ggatcatgag 
aaaaaaagtc aaaactacaa catatttcag 
tgtatttatc ggcatcgcaa ggaccatgat 
gaggtagaaa acttcagttg ttccagatga 
tcagttccta tttaagtcat ttttgtcatg 
catct 

<210> 439 
<211> 533 
<212> DNA 
<213> Homo sapiens 



ggagatgctt ctagctctca gataatacat 60 
tgtgttctgc gaatgactcg agctagacgt 120 
gttgaaaagg ctttggcaat tctttctcag 180 
cgattaaaaa atcttttgaa gactgttgtt 240 
ttggaaaatt tgtatgcagt aatcagccaa 300 
aaaacatcac ttattcagaa aatggagcaa 360 
tgatgtcatg gtatcgagta ttctttatat 420 
tccgcctaat tgatgtagta tgaaaccctg 480 

485 



<400> 439 

acagcagttt cctcatccct gcagctgtgt 

aggttcaaca gtatggctcc aaatgatgaa 

ttctgtttgt gtatgtccac cacagttact 

aaacatatat ggaaatgttc tgtgcaataa 

tgagtgtctc actggagact gaagtccaca 

gtttttgcct cctgctggtc atgtgctttc 

gccacagtgt catttgctgt tgtctgatgg 

aactttatta tccaggacgc tgagagtata 

gatgtaacat gtcaagcaga aatagggagc 

<210> 440 
<211> 341 
<2X2> DNA 
<213> Homo sapiens 



ttgaacaggt catttaccat actgtcctcc 60 
atttcattct gattttctgg ctgaagacta 120 
ttatcccttc atctgtggat gggcagaatg 180 
aaacagcagt ggtaacacag atgtaggctc 240 
gatatgcaac aaagcctttg tctccctgat 300 
acacatcaag agaggacatt taacatttga 360 
ttggttggca gagaatttga actggagatg 420 
acatgcatga cagagctttt agagcactgt 480 
atgtttacag ccattctatg aaa 533 
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<400> 440 

catggggtag 

cactctcaca 

cagtctcact 

ctggagcatt 

gcggaggaga 

tcagaaaagg 



gggggtcggg 
ctcgcaccca 
cacaaaaaat 
ggcagggaca 
aaagacagag 
gtcagcccga 



gattcattga 
gaattgtcaa 
aaaatctcat 
gtcagaaagg 
ggagagagac 
gacaggctga 



attgtggttg 
agatacagat 
gtccccaacg 
agacaagtga 
catcgggaac 
gccagagttt 



gcaggagcaa 
tgtaaaaatc 
aacccagagt 
aaacggtcag 
aatcagaggg 



gccctgctca 60 
tacgatccct 120 
cagacgacag 180 
atggacacag 240 
gccgagacga 300 
341 



<210> 441 

<211> 572 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 53, 84, 132, 138, 148 

<223> n = A,T,C or G 



<400> 441 

aagtttgggg 

aatatgttaa 

ctggccaagg 

gctctggtcc 

ttatccttcc 

acataaatca 

gggaaatagc 

gtagcaggtg 

gtgagaaata 

ggcctttcta 



ataatttatt 
tataaatctc 
anactttntt 
tacaattcca 
agaacctcaa 
ttaatacagc 
ctgtggaggt 
ggctcagggc 
gtgtcttctt 
ggctagggca 



atgcagcaag 
caanaaaaaa 
tttttganaa 
tgtaacttct 
cttttccctt 
cctggatggg 
cctaaaaaga 
ccatttcagt 
ttagcttgct 
tgaacctcct 



agataataca 
gatatacaat 
tattcttcaa 
aaccttgatt 
ttacaaagta 
cagattctga 
tctacggggc 
ctttgttccc 
cataactcaa 



caggacttct 
gaaacattcc 
aaagctgatc 
ttatctcatg 
gaaataaacc 
gctatttttg 
tcgagatggt 
caggccattt 
agatgggggg 



canagcactt 60 
tcttagttat 120 
taatgatatg 180 
agcaaatcat 240 
atctgccttt 300 
gctggggggt 360 
tctctgcaag 420 
ccacaaaatg 480 
catggacctg 540 
572 



<210> 442 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 34, 67 

<223> n = A,T,C or G 



<400> 442 

tcccagctgc actgcttaca cgtcttcctt 
ccccagntgt gcagctgccc accgcaaggg 
gctcagctca cccacgctgc tggccctgtg 
ccacaagtgc cactgcccga gctggtgcat 
gggttcctgt agacctaggg aggaccttat 
tcaaggactt gaaagcatcc atgtgtggac 
aaaacgcatg gagtgtgta 

<210> 443 
<211> 511 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 444 

<223> n = A,T,C or G 



cgtnttcacc taccccgagg ctgactcctt 60 
cagcagcagc aatgagcctt cctctgactc 120 
agggggcagg gaaggggagg cagccggcac 180 
tacagagagg agaaacacat cttccctaga 240 
ctgtgcgtga aacacaccag gctgtgggcc 300 
tcaagtcctt acctcttccg gagatgtagc 360 

379 
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<400> 443 

acatgccccc aaaggctcgc ttcattgcta cgattctcta cttaaatcca cattcacagc 60 

tattgcctca gaccctctgg aggaggggcc aggggttagc tggctttgaa tagcatgtag 120 

agcacaggca gtgtggccac aaatgtcaca caggtgacca gggtgctata gatggtgttc 180 

ctgttgactt gggcttctag tctctgctcc gtgtctgaca gtgccaagat catgctcccc 240 

tgctccagca agaagctggg catagccccg tctgctggtt ccaccaggcc tgggtgtgct 300 

gcagacttta caagctgaac caccccagcc atttggctac aagtcttttc taggccatca 360 

agctgctctc gtaagccttc tagacatgaa tggacttgcc tggaatgact aagctgctct 420 

•btcaaggcag ctgaaaggac atcnacatct ctgtctctgg tcgggggact acctgcctgt 480 

gacccagagt cctgccctgg cccagcagca t 511 

<210> 444 
<211> 612 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 
<222> 547 

<223> n == A,T,C or G 
<400> 444 

acaggaagaa ttctacagtt aatctatcac agtgttccag caaagcatat gttgaaaact 60 

acagttttca atctaacatc taaattttaa aaagtagcat ttcagcaaca aacaagctca 120 

gagaggctca tggcaaaagt gaaataacag aactattgct cagatgtctg caaagtcaag 180 

ctgctgccct cagctccgcc cacttgaagg cttaggcaga cacgtaaggt ggcggtggct 240 

ccttggcagc accattcaca gtggcatcat catacggagg tagcagcacc gtagtgtcat 300 

tgctggtaac ataaaccagg acatcagagg agttcctacc attgatgtat cggtagcagt 360 

tccaaacaca gctaatcaag taacccttaa aagtcaagat aatgctaata aacagaagaa 420 

taataaggac caaacaggta ggattcactg acatgacatc atctctgtag ggaaaattag 480 

gaggcagttg ccgtatgtat tcctgaatgg agtttggata aataagcaca gtgattgcaa 540 

ccaacanctt cagggcaaag tcaaagatct ggtaacagaa gaatgggatg atccaggctg 600 

cgcgttgctt gt 612 

<210> 445 
<211> 708 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 643, 676 
<223> n = A,T,C or G 

<400> 445 

accatcctgt tccaacagag ccattgccta ttcctaaatt gaatctgact gggtgtgccc 60 

ctcctcggaa cacaacagta gaccttaata gtggaaacat cgatgtgcct cccaacatga 120 

caagctgggc cagctttcat aatggtgtgg ctgctggcct gaagatagct cctgcctccc 180 

agatcgactc agcttggatt gtttacaata agcccaagca tgctgagttg gccaatgagt 240 

atgctggctt tctcatggct ctgggtttga atgggcacct taccaagctg gcgactctca 300 

atatccatga ctacttgacc aagggccatg aaatgacaag cattggactg ctacttggtg 360 

tttctgctgc aaaactaggc accatggata tgtctattac tcggcttgtt agcattcgca 420 

ttcctgctct cttaccccca acgtccacag agttggatgt tcctcacaat gtccaagtgg 480 

ctgcagtggt tggcattggc cttgtatatc aagggacagc tcacagacat actgcagaag 540 

tcctgttggc tgagatagga cggcctcctg gtcctgaaat ggaatactgc actgacagag 600 

agtcatactc cttagctgct ggcttggccc tgggcatggt ctncttgggg catggcagca 660 

atttgatagg tatgtntgat ctcaatgtgc ctgagcagct ctatcagt 708 
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<210> 446 
<211> 612 
<212> DNA 
<213> Homo sapiens 

<400> 446 

acaagcaacg cgcagcctgg atcatcccat 
tgaacatgtt ggttgcaatc actgtgctta 
ggcaactgcc tcctaatttt ccctacagag 
tggtccttat tattcttctg tttattagca 
gctgtgtttg gaactgctac cgatacatca 
atgttaccag caatgacact acggtgctgc 
gtgctgccaa ggagccaccg ccaccttacg 
tgagggcagc agcttgactt tgcagacatc 
catgagcctc tctgagcttg tttgttgctg 
attgaaaact gtagttttca acatatgctt 
aattcttcct gt 

<210> 447 
<211> 642 
<212> DNA 

<213> Homo sapiens 



tcttctgtta ccagatcttt gactttgccc 60 

tttatccaaa ctccattcag gaatacatac 120 

atgatgtcat gtcagtgaat cctacctgtt 180 

ttatcttgac ttttaagggt tacttgatta 240 

atggtaggaa ctcctctgat gtcctggttt 300 

tacccccgta tgatgatgcc actgtgaatg 360 

tgtctgccta agccttcaag tgggcggagc 420 

tgagcaatag ttctgttatt tcacttttgc 480 

aaatgctact ttttaaaatt tagatgttag 540 

tgctggaaca ctgtgataga ttaactgtag 600 

612 



<400> 447 

actgaaagaa ttaaagtcag 
cctttctgat acttttcatt 
acaaaataat gtagcaccag 
ccagcaatgt ggaagaggta 
ttggttttct ggaagtagag 
tagaaaaact tcatttgaaa 
gggtctgaga tgtagttttc 
tagaacgcac taagctgacc 
attcgcctgt taattttatc 
actagcatgt agaagaaaac 
gattcagtag cttgttcttc 

<210> 448 
<211> 394 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 66 

<223> n = A,T,C or G 
<400> 448 

accagaagac cttagaaaaa ggaggaaagg aggagaggca gataatttgg atgaattcct 60 
caaagngttt gaaaatccag aggttcctag agaggaccag caacagcagc atcagcagcg 120 
tgatgttatc gatgagccca ttattgaaga gccaagccgc ctccaggagt cagtgatgga 180 
ggccagcaga acaaacatag atgagtcagc tatgcctcca ccaccacctc agggagttaa 240 
gcgaaaagct ggacaaattg acccagagcc tgtgatgcct cctcagcagg tagagcagat 300 
ggaaatacca cctgtagagc ttcccccaga agaacctcca aatatctgtc agctaatacc 360 
agagttagaa cttctgccag aaaaagagaa ggag 394 

<210> 449 
<211> 494 



aagtcttccc aaaacaaaaa gaactgccca cagagaaaat 60 

gctaaaataa aacaggcggg aaatgtggaa aagaaattca 120 

aagaacaagt cctagatgat tcaagttcaa aaggtaagct 180 

aagaccaatg tagacaagct gacgaggaat atcttctttt 240 

ttcaggaaaa gcatgaagcc agtaagccag ctgtgatatg 300 

tgtcatcagg ttatggggat aagccctcca taagatagtt 360 

agagatgaga atgaatgtgc cccaaacaca ggcaaaaagg 420 

agattcatta aacttgctgt gttttgtttt ggagaagtgc 480 

caacatatac tcttgaatta cggcatgaat aattatcgcc 540 

agtagccaaa tctttgatgc catagtaata aagggacact 600 

tgttgctggg agggtgacat tg 642 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 
<222> 66 

<223> n = A,T,C or G 
<400> 449 

acaaaaaaca caaggaatac aacccaatag 
aaggcntgag tgtctttctc aaccgtgcaa 
aggatccgct actcaaaaac caagaattta 
tgaagctcac ttttcagtgc cattgatgtg 
ttcctaaaga ttattgttaa atagatattg 
taaatgtcat tttagagatg gggagaggga 
gaaactgata aaagcaactt agcaaggctt 
aagtcttagg taactagtag gatagaaaca 
tattgattag agcc 

<210> 450 
<211> 547 
<212> DNA 
<213> Homo sapiens 

<400> 450 

actttgggct ccagacttca ctgtccttag 
tcatgactga ggttaactta aaacaaaaat 
gagacaaatt tgcttttgta gaattggtgg 
aagacatttg tcaccactag ccaccaagtt 
gaaacgtaga tcatcagctc tgctaagtag 
ccaaatggga tcctgtggtt acagtgaatg 
ccgagaataa catggcactt atactgatgg 
agaatatgga accaccgtgc ttgcatcaat 
ccatccattg gcacttggct cagcacagtt 
aaacagt 

<210> 451 
<211> 384 
<212> DNA 
<213> Homo 

<220> 
<221> misc_feature 
<222> 8, 9, 19, 41 
<223> n = A,T,C or G 

<400> 451 

actacttnnt ggttaaaang ccactggtag 
ctgctggaaa aatccactgg ctcccaagaa 
tc-tcacaact catctttccc taagtcatca 
tccacgggaa gctcgccatc cctgccgtcc 
gccccgcctc ctacaggtgc cgtggagcca 
ttgatgcctt gctcactgaa ggagcctttt 
atggcggcca tggcggggta coca 



aaaatagtcc tgggaatgtg gtcagaagca 60 
aagccgtgtt cttcccggga aaccaggaaa 120 
aaggagtttc ttaaatttcg accttgtttc 180 
agatgtgctg gagtggctat taaccttttt 240 
tggtttgggg aagttgaatt ttttataggt 300 
ttatactgca ggcagcttca gccatgttgt 360 
cttttcatta ttttttatgt ttcacttata 420 
ctgtgtcccg agagtaagga gagaagctac 480 

494 



gcattgaaac catcacctgg tttgcattct 60 

ggtaggaaag ctttcctatg cttcgggtaa 120 

ctgagaaagg cagacagggc ctgattaaag 180 

aagttgtgga acccaaaggt gacggccatg 240 

ttaggggaag aaacatattc aaaccagtct 300 

gccactcctg ctttattttt cctgagattg 360 

gcagatgacc agatgaacat catcatccca 420 

agatttttcc ctgttatgta ggcattcctg 480 

aggccaacaa ggacataata gacaagtcca 540 

547 



agtcatctga ntgtaaacaa tgtccctgca 60 
aagaaaatgg tctgaagcct ctgttgtggc 120 
agctccacat cactgaggtc aatgtcatcc 180 
caaggctctc tctcaacgat ggtagggaaa 240 
cgcccaaaag agagctccct gagaaactcg 300 
agcagagcaa atttcatctt gcgtgcattg 360 

384 



sapiens 



<210> 
<211> 
<212> 



452 
381 
DNA 
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<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 291, 341, 368 
<223> n = A,T,C or G 



<400> 452 

actctaaagt tgccactctc acaggggtca 
cctgcagcca gaatctgcaa gcagcgcctg 
gtggggttgt tcatcacagc ataatggcct 
tcaaagccag gaaaacgaat cctcaagtcc 
gactgagaag cattttcctc gaaccagcgg 
aatactttga tatccaaatg gagttctgga 
gtaatganaa tcttcactgt a 

<210> 453 
<211> 455 
<212> DNA 
<213> Homo sapiens 



gtgataccca ctgaacctgg caggaacagt 60 
tatgcaacgt ttagggccaa aggctgtctg 120 
agtaggtcaa ggatccaggg tgtgaggggc 180 
ttcagtagtc tgatgagaac tttaactgtg 240 
gcatgtcgga tggctgctaa ngcactctgc 300 
tccagttttc naagattggg tggcactgtt 360 

381 



<400> 453 

actgtgctaa acagcctata gccaagtttt 
caaagaacag gcattcactg cagcctcctg 
agtcactctg ccaccacttt tcctgccttg 
tgtttttcct atatctgccc tactaaggta 
tttttaacgt ttgactgaca gaagcagcac 
aaagtatgct ttcagtaaaa cattttacca 
tcttcctgaa aaggggattt atgctaacac 
agatatttta acttcctgag tgacttatac 

<210> 454 
<211> 383 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 9 

<223> n <= A,T,C or G 



aaagagttac aggaacaact gctacacatt 60 
atttgacctg atgggaggga caggagaatg 120 
gatttgtaga ggatttgttt tgctctaatt 180 
cacagtctgg gcactttgaa aatgttaaag 240 
ttaaaggctt catgaatcta ttttccaaaa 300 
ttttatctaa ctatgcactg acatttttgt 360 
tgtattttta atgtaaaaat atacgtgtag 420 
ctcaa 455 



<400> 454 

acagagcanc tttacaagtt gtcacatttc 
tacaaaatga attgcggttt tattacatta 
ggaaagggtt ttatgcaaaa gaaagtgcta 
gggggtgaag ttgtatgata aagcagatat 
agaaatttct tgctagtgaa tcaagaaaac 
gtattttagt taattcaaaa atgaaacttt 
gagaaggctt attggtaaag ttt 

<210> 455 
<211> 383 
<212> DNA 
<213> Homo sapiens 



tttataaatt tttttaaagc tacagtttaa 60 
ataacctttc acctcagggt tttatgaaga 120 
caattcctaa tcattttaga cactttagga 180 
tttaattatt tgttatcttt ttgtattgca 240 
atccagattg acagtctaaa atggctactg 300 
tcagtgattc actttactaa cattctattt 360 

383 



<220> 

<221> inisc_feature 
<222> 10 
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<223> n = A,T,C or G 



<400> 455 

actcctttan gacaaggaaa caggtatcag 
gtgcaggagc tgacttcttc caaagagttg 
attgctggag ggctgatttt agtgttgctt 
gaaaataaga ggctgcagga tcagcggcaa 
cacggacacc attccaaaaa ggggcaggtt 
gtcagtgggc acgagaactg ctgtctgacc 
aacgataaga tcctctcgct tgt 



catgatggta gcagaaacct tatcaccaag 60 

tggttccggg cagcggtcat tgccgtgccc 120 

attatgttgg ccctgaggat gcttcgaagt 180 

cagatgctct cccgtttgca ctacagcttt 240 

gcaaagttag acttggaatg catggtgccg 300 

tgtgataaaa tgagacaagc agacctcagc 360 

383 



<210> 456 
<211> 543 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 64 

<223> n = A,T,C or G 
<400> 456 

acaaacattt tacaaaaaag aacattacca 
atangtagac tgagtttccg ggcaatgtct 
cagctgaaac aggcttcttt cccagtgaca 
taaatgaggc tactacatag gcccagttaa 
atacaagtgg aactcatcaa ataatttaaa 
taaactcatt taagccttca caatgtcgca 
gttgtcatac agatacttgt tttttacaca 
ccagtcaggt ttcctgttgt tggaccgaaa 
gacagaagtg agaaagaaag gagaccctga 
caa 



atatcagtgg cagtaagggc aagctgaaga 60 
gtcctcaaag acatccaaac tgcgttcagg 120 
agcatatgtg gtcagtaata caaacgatgg 180 
caaactcctc ttctcctcgg gtaggccatg 240 
cccaaggcga taacaacact atttcccatc 300 
atggattcag ttacttgcaa acgatcccgg 360 
taacgctgtg ccatcccttc cttcactgcc 420 
ggggatacat tttagaaatg cttccctcaa 480 
ggccaggatc tattaaacct ggtgtgtgcg 540 

543 



<210> 457 
<211> 544 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 17 

<223> n = A,T,C or G 



<400> 457 

actggtgcca atattgncat ggtgagctcc 
gcccatgtca cattagggac agtgacaaag 
gatagagcaa caatgaaatc attcagttca 
aggggatctt ggtctcttag caaccccagc 
gtgggctcag ttcccagcct ttcctcctgg 
ggatttttct tctgctgttc ctgtagcttc 
gcctctcggc acagtgccat gagatcagca 
tgacagaaat caaaagcttg aggaagcctc 
tccctggatg cttcatctgg gatacctagg 
ctca 



tctctaatgt cttccagggc accaatatct 60 
ccttcccttt tggcagaggg ttggactgag 120 
atgcacagtc cttgcatctg ctcctctgag 180 
agcctttgta attcatcctg tgtttcagaa 240 
actcctttag atggcaaatc ttccatttca 300 
attaagactc tattgactgc acacattgct 360 
ccaacaaagc ctggagttag gtgtgctaag 420 
agttttctgc acaatgtttg aagtattctt 480 
catatttctc ggtcgaacct tcccgcacgt 540 

544 



<210> 458 
<211> 382 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> inisc_f eature 

<222> 5, 23 

<223> n = A,T,C or G 



<400> 458 

acctntaggc tcaacggcag aancttcacc 
aaaactggtt gtcctggatg tttgaaaagt 
tatctatcat taactccatg gcacaaagtt 
cagaggagaa aactaaataa gtagagaaag 
ttctgcctta aattgggagg actccaagcc 
tcaattttta caactttttt cctgaaagca 
tttccttctg tgctaaggta ag 

<210> 459" 

<211> 168 

<212> DNA 

<213> Homo sapiens 



acaaaagcga aatgggcaca ccacagggag 60 
tggtcgttgt catggtgtgt tacttcatcc 120 
atgccaaacg aatccagcag cggttgaact 180 
ttttaaactg cagaaattgg agtggatggg 240 
gggaaggaaa attccctttt ccaacctgta 300 
gtttagtcca tactttgcac tgacatactt 360 

382 



<400> 459 

ctcgtactct agccaggcac gaaaccatga agtagcctga tccttcttag ccatcctggc 60 

cgccttagcg gtagtaactt tgtgttatga atcacatgaa agcatggaat cttatgaact 120 

taatcccttc attaacagga gaaatgcaaa taccttcata tcccctca 168. 



<210> 460 
<211> 190 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 4 

<223> n = A,T,C or G 



<400> 460 

acanctgcta ccagggagcc gagagctgac 
catggatgga gcttcacacg atttcctcct 
acctggcgtc accagtggcc cgtctgcctc 
ctcctttccc 



tatcccagcc tcggctaatg tattctacgc 60 
gcggcagcgg cgaaggtcct ctactgctac 120 
aggaactcct ccgagtgagg gaggaggggg 180 

190 



<210> 461 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<400> 461 

acagaoaggc ttctctgcta tcctccaggc 
attggatcat tccttagaca ctaatcagct 
cccaagaatg gtttacacca agcagagagg 
cgtatccaca gtcactgtaa gcatccagta 
atgaaagcag atttgagata cccagctccg 
tcactgagac aatgaattca gggtgatcat 
ttttcactac cacattagct tggctctctg 
tggtatatat gtggtcaaaa cgaattagtt 
tccccactga cagag 



agtgtaatag tcaaggaaaa gggcaacagt 60 

ggggaaagag ttcattggca aaagtgtcct 120 

acatgtcaci: gaatggggaa agggaacccc 180 

ggcaggaaga tggctttggg cagtggctgg 24 0 

gaacgaggtc atcttctaca ggttcttcct 300 

tctctgaggg gctgagaggt gcttcctcga 360 

tctcagaggg tatctctaag actaggggct 420 

cattaatggc ttccagcttg gctgatgacg 480 

495 
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<210> 462 
<211> 493 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_f eature 
<222> 68 

<223> n = A,T,C or G 



<400> 462 

acactgaaac ataaatccgc aagtcaccac 
aacagggngt ttacatgatc cctgtaacag 
tctgccaagt gtgttttgga tacagagcac 
ggctgtgggt ccacagagca ctcatctggc 
aagcaaagca gttaccagca cattcaaaca 
gagaaacaag aaggcaacat aataatgtta 
gtgtaaagct tgaggctgaa aagtagcttg 
tgggcgccgg aacagcaaga tgtgaggttc 
ctgactctgc tga 

<210> 463 

<211> 3681 

<212> DNA 

<213> Homo sapiens 



acatacaaca cccggcagga aaaaaacaaa 60 
ccatggtctc aaactcagat gcttcctcca 120 
atcgtggctt ctggggtcac actcagctta 180 
tgggctatgg tggtggtggc tctactcaag 240 
gtgtattgaa catcttttaa atatcaaagt 300 
tcagaaagat gttaggaagt aaggacagct 360 
ccagcttcat ttctttggtt tcttgggtag 420 
tggttcatgg atcatataat ggacccatcc 480 

493 



<400> 463 

tccgagctga ttacagacac caaggaagat 
ctagctggcc ctgtgggcat ttattagtaa 
cacccgtggg taattaacct ggtcatcccc 
tcaaagaagg aacatctgca ggaacacctg 
ctgacacagc tgaaagcttg gtggaaaaaa 
gaacacctga cacggctgaa agcttggtgg 
tggagggaac atctgacaaa attcaatgtt 
agtcagcaga agaaacacct agggaaatta 
ttacgtggcc agcaaaagga agacctagga 
ctagggaaat tatgagtccc gcaaaagaaa 
gaagacctag gaagatcgca tgggagaaaa 
caagagtaac atctaataaa actaaagttt 
gtcctacaaa agaatcatct acaaaagcaa 
ccaaacaaga ggaagatgaa gaatattctt 
caaagattca agtgtgtata cctgagtcta 
aagtagaaga gcctcctaag aagccatctg 
ctgttccaaa taaagccttt gaattgaaga 
tcccaccaga atccaaacaa aaggactatg 
gtgagactgt ttcacagaag gatgtgtgtt 
ataaaataaa tggaaaatta gaagagtctc 
gcggaatgaa agtttctatt ccaactaaag 
aagcagagcc tccggggaag ccatctgcct 
tcccaaataa agccttggaa ttgaaaaatg 
ca'tcagaatc caaacaaaag gac'ta'tgaag 
agactgtttc acagaaggat gtgtgtttac 
aaataaatgg aaaattagaa gggtctcctg 
gaatgaaagt ttctattcca actaaagcct 
cagagcctcc cgagaagcca tctgccttcg 
caaataaagc cttggaattg aagaatgaac 
cagaatccaa acaaaaggac tatgaagaaa 
ctgtttcaca gaaggatgtg tgtttaccca 



gctgtaaaga gtcagcagcc acagccctgg 60 
agttttaatg acaaaagctt tgagtcaaca 120 
accctggaga gccatcctgc ccatgggtga 180 
atgaggctgc acccttggcg gaaagaacac 240 
cacctgatga ggctgcaccc ttggtggaaa 300 
aaaaaacacc tgatgaggct gcatccttgg 360 
tggagaaagc gacatctgga aagttcgaac 420 
cgagtcctgc aaaagaaaca tctgagaaat 480 
agatcgcatg ggagaaaaaa gaagacacac 540 
catctgagaa atttacgtgg gcagcaaaag 600 
aagaaacacc tgtaaagact ggatgcgtgg 660 
tggaaaaagg aagatctaag atgattgcat 720 
gtgccaatga tcagaggttc ccatcagaat 780 
gtgattctcg gagtctcttt gagagttctg 840 
tatatcaaaa agtaatggag ataaatagag 900 
ccttcaagcc tgccattgaa atgcaaaact 960 
atgaacaaac attgagagca gatccgatgt 1020 
aagaaaattc ttgggattct gagagtctct 1080 
tacccaaggc tacacatcaa aaagaaatag 1140 
ctaataaaga tggtcttctg aaggctacct 1200 
ccttagaatt gaaggacatg caaactttca 1260 
tcgagcctgc cactgaaatg caaaagtctg 1320 
aacaaacatt gagagcagat gagatactcc 1380 
aaagttcttg ggattctgag agtctctgtg 1440 
ccaaggctrc rcatcaaaaa gaaatagata 1500 
ttaaagatgg tcttctgaag gctaactgcg 1560 
tagaattgat ggacatgcaa actttcaaag 1620 
agcctgccat tgaaatgcaa aagtctgttc 1680 
aaacattgag agcagatgag atactcccat 1740 
gttcttggga ttctgagagt ctctgtgaga 1800 
aggctrcrca tcaaaaagaa atagataaaa 1860 
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taaatggaaa attagaagag tctcctgata 
tgaaagtttc tattccaact aaagccttag 
agcctcccga gaagccatct gccttcgagc 
ataaagcctt ggaattgaag aatgaacaaa 
aatcaaaaca aaagaasgtt gaagaaaatt 
tttcacagaa ggatgtgtgt gtacccaagg 
gtggaaaatt agaagattca actagcctat 
aaagagcaag ggaacttcaa aaagatcact 
tgaaaaagaa gttttgtgta ctgaaaaaga 
agttagagaa ccaaaaagtt aaatgggaac 
tcatgaaaat gaaaattatc tcttacatga 
gctaaaactg gaaatagcca cactgaaaca 
tgaggacatt aagattttaa aagaaaagaa 
agaggaatca ttaactaaaa gggcatctca 
tgagaacaca atgctcactt ctaaattgaa 
agaaattgaa tcacaccatc ctagactggc 
gacatcaaga aaaagtcaag aacctgcttt 
aaaaatgaat gttgatgtga gtagtacgat 
ttctgaagct caaaggaaat ccaaaagcct 
tctaagagaa aatacattgg tttcagaaca 
tcaaatgaag gaagctgaac acatgtatca 
tgaacagcag gagtctctag atcagaaatt 
tcaacagcaa ttagttcatg cacataagaa 
tattcatttt cttgagagga aaatgcaaca 
atttaattac aataaccatt taaaaaaccg 
aacagaaaac tcatgagaga caagcagtaa 
atctttactc acaactcatg ctaggaggcc 
taccaatagt ctgtgtcaac agaatactta 
gaagcctaca gacataaaat aacagtgtga 
tgcacaggat tcccatctac cctgatgatg 
tcgctctgtc actcaggctg g 

<210> 464 

<211> 1424 

<212> DNA 

<213> Homo sapiens 

<400> 464 

tccgagctga ttacagacac caaggaagat 
ctagctggcc ctgtgggcat ttattagtaa 
cacccgtggg taattaacct ggtcatcccc 
tcaaagaagg aacatctgca ggaacacctg 
ctgacacagc tgaaagcttg gtggaaaaaa 
gaacacctga cacggctgaa agcttggtgg 
tggagggaac atctgacaaa attcaatgtt 
agtcagcaga agaaacacct agggaaatta 
ttacgtggcc agcaaaagga agacctagga 
ctagggaaat tatgagtccc gcaaaagaaa 
gaagacctag gaagatcgca tgggagaaaa 
caagagtaac atctaataaa actaaagttt 
gtcctacaaa agaatcatct acaaaagcaa 
ccaaacaaga ggaagatgaa gaatattctt 
caaagattca agtgtgtata cctgagtcta 
aagtagaaga gcctcctaag aagccatctg 
ctgttccaaa taaagccttt gaattgaaga 
tcccaccaga atccaaacaa aaggactatg 
gtgagactgt ttcacagaag gatgtgtgtt 
ataaaataaa tggaaaatta gaaggtaaga 



atgatggttt tctgaaggct ccctgcagaa 1920 
aattgatgga catgcaaact ttcaaagcag 1980 
ctgccattga aatgcaaaag tctgttccaa 2040 
cattgagagc agatcagatg ttcccttcag 2100 
cttgggattc tgagagtctc cgtgagactg 2160 
ctacacatca aaaagaaatg gataaaataa 2220 
caaaaatctt ggatacagtt cattcttgtg 2280 
gtgaacaacg tacaggaaaa atggaacaaa 2340 
aactgtcaga agcaaaagaa ataaaatcac 2400 
aagagctctg cagtgtgagg tttctcacac 2460 
aaattgcatg ttgaaaaagg aaattgccat 2520 
ccaataccag gaaaaggaaa ataaatactt 2580 
tgctgaactt cagatgaccc taaaactgaa 2640 
atatagtggg cagcttaaag ttctgatagc 2700 
ggaaaaacaa gacaaagaaa tactagaggc 2760 
ttctgctgta caagaccatg atcaaattgt 2820 
ccacattgca ggagatgctt gtttgcaaag 2880 
atataacaat gaggtgctcc atcaaccact 2940 
aaaaattaat ctcaattatg cmggagatgc 3000 
tgcacaaaga gaccaacgtg aaacacagtg 3060 
aaacgaacaa gataatgtga acaaacacac 3120 
atttcaacta caaagcaaaa atatgtggct 3180 
agctgacaac aaaagcaaga taacaattga 3240 
tcatctccta aaagagaaaa atgaggagat 3300 
tatatatcaa tatgaaaaag agaaagcaga 3360 
gaaacttctt ttggagaaac aacagaccag 3420 
agtcctagca tcaccttatg ttgaaaatct 3480 
ttttagaaga aaaattcatg atttcttcct 3540 
agaattactt gttcacgaat tgcataaagc 3600 
cagcagacat cattcaatcc aaccagaatc 3660 

3681 



gctgtaaaga gtcagcagcc acagccctgg 60 
agttttaatg acaaaagctt tgagtcaaca 120 
accctggaga gccatcctgc ccatgggtga 180 
atgaggctgc acccttggcg gaaagaacac 240 
cacctgatga ggctgcaccc ttggtggaaa 300 
aaaaaacacc tgatgaggct gcatccttgg 360 
tggagaaagc gacatctgga aagttcgaac 420 
cgagtcctgc aaaagaaaca tctgagaaat 480 
agatcgcatg ggagaaaaaa gaagacacac 540 
catctgagaa atttacgtgg gcagcaaaag 600 
aagaaacacc tgtaaagact ggatgcgtgg 660 
tggaaaaagg aagatctaag atgattgcat 720 
gtgccaatga tcagaggttc ccatcagaat 780 
gtgattctcg gagtctcttt gagagttctg 840 
tatatcaaaa agtaatggag ataaatagag 900 
ccttcaagcc tgccattgaa atgcaaaact 960 
atgaacaaac attgagagca gatccgatgt 1020 
aagaaaattc ttgggattct gagagtctct 1080 
tacccaaggc tacacatcaa aaagaaatag 1140 
accgtttttt atttaaaaat cagttgaccg 1200 
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aatatttctc taaactgatg aggagggata 

aaatgcaaac catggaaaaa aagagaagtg 

tgttggcaac agactatatt gagagtgctg 

agatattgca agacctgaga gaaaaaaaaa 

<210> 465 
<211> 674 
<212> DNA 
<213> Homo sapiens 



tcctctagta gctgaagaaa attacctcct 1260 

caatggtcgt aagttgtatg tctcatcagg 1320 

aaaaggagct gaattattag tttgaattca 1380 

aaaaaaaaaa aaaa 1424 



<400> 465 

attccgagct gattacagac accaaggaag 
ggctagctgg ccctgtgggc atttattagt 
cacacccgtg ggtaattaac ctggtcatcc 
gatcaaagaa ggaacatctg caggaacacc 
acctgacaca gctgaaagct tggtggaaaa 
aagaacacct gacacggctg aaagcttggt 
ggtggaggga acatctgaca aaattcaatg 
acagtcagca gaagaaacac ctagggaaat 
atttacgtgg ccagcaaaag gaagacctag 
agttaaggca aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaa 

<210> 466 

<211> 1729 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> iaisc_feature 
<222> 11, 1128 
<223> n = A,T,C or G 



atgctgtaaa gagtcagcag ccacagccct. 60 
aaagttttaa tgacaaaagc tttgagtcaa 120 
ccaccctgga gagccatcct gcccatgggt 180 
tgatgaggct gcacccttgg cggaaagaac 240 
aacacctgat gaggctgcac ccttggtgga 300 
ggaaaaaaca cctgatgagg ctgcatcctt 360 
tttggagaaa gcgacatctg gaaagttcga 420 
tacgagtcct gcaaaagaaa catctgagaa 480 
gaagatcgca tgggagaaaa aagatgactc 540 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 600 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 660 

674 



<400> 466 

gaaagttcga ncagtcagca gaagaaacac 
catctgagaa atttacgtgg ccagcaaaag 
aagaagacac acctagggaa attatgagtc 
gggcagcaaa aggaagacct aggaagatcg 
ctggatgcgt ggcaagagta acatctaata 
agatgattgc atgtcctaca aaagaatcat 
tcccatcaga atccaaacaa gaggaagatg 
ttgagagttc tgcaaagatt caagtgtgta 
agataaatag agaagtagaa gagcctccta 
aaatgcaaaa ctctgttcca aataaagcct 
cagatccgat gttcccacca gaatccaaac 
ctgagagtct ctgtgagact gtttcacaga 
aaaaagaaat agataaaata aatggaaaat 
tgaaggctac ctgcggaatg aaagtttcta 
tgcaaacttt caaagcagag cctccgggga 
tgcaaaagtc tgtcccaaat aaagccttgg 
atgagatact cccatcagaa tccaaacaaa 
agagtctctg tgagactgtt tcacagaagg 
aagaaataga taaaataaat ggaaaattag 
aggctaactg cggaatgaaa gtttctattc 
aaactttcaa agcagagcct cccgagaagc 
aaaagtctgt tccaaataaa gccttggaat 
agatactccc atcagaatcc aaacaaaagg 



ctagggaaat tacgagtcct gcaaaagaaa 60 
gaagacctag gaagatcgca tgggagaaaa 120 
ccgcaaaaga aacatctgag aaatttacgt 180 
catgggagaa aaaagaaaca cctgtaaaga 240 
aaactaaagt tttggaaaaa ggaagatcta 300 
ctacaaaagc aagtgccaat gatcagaggt 360 
aagaatattc ttgtgattct cggagtctct 420 
tacctgagtc tatatatcaa aaagtaatgg 480 
agaagccatc tgccttcaag cctgccattg 540 
ttgaattgaa gaatgaacaa acattgagag 600 
aaaaggacta tgaagaaaat tcttgggatt 660 
aggatgtgtg tttacccaag gctacacatc 720 
tagaagagtc tcctaataaa gatggtcttc 780 
ttccaactaa agccttagaa ttgaaggaca 840 
agccatctgc cttcgagcct gccactgaaa 900 
aattgaaaaa tgaacaaaca ttgagagcag 960 
aggactatga agaaaattct tgggatactg 1020 
atgtgtgttt acccaaggct gcgcatcaaa 1080 
aagggtctcc tggtaaanat ggtcttctga 1140 
caactaaagc cttagaattg atggacatgc 1200 
catctgcctt cgagcctgcc attgaaatgc 1260 
tgaagaatga acaaacattg agagcagatg 1320 
actatgaaga aagttcttgg gattctgaga 1380 
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gtctctgtga gactgtttca cagaaggatg 
aaatagataa aataaatgga aaattagaag 
tgaccaaata tttctctaaa ttgatgagga 
cctcctaaat gcaaaccatg gaaaaaaaga 
atcaggcatt ggcaacagac tatattgtga 
aattcaagat attccaagac gtgaggaaaa 

<210> 467 

<211> 1337 

<212> DNA 

<213> Homo sapiens 



tgtgtttacc caaggctgcg catcaaaaag 1440 

gtaagaaccg ttttttattt aaaaatcatt 1500 

aggatatcct ctagtagctg aagaaaatta 1560 

gaagtgcaat ggtcataagc tatgtgtctc 1620 

gtgctgaaga ggagctgaat tactagttta 1680 

tgagaaaaaa aaaaaaaaa 1729 



<400> 467 

aaaaagaaat agataaaata aatggaaaat 
tgaaggctaa ctgcggaatg aaagtttcta 
tgcaaacttt caaagcagag cctcccgaga 
tgcaaaagtc tgttccaaat aaagccttgg 
atgagatact ccca-tcagaa -tccaaacaaa 
agagtctctg tgagactgtt tcacagaagg 
aagaaataga -baaaataaat ggaaaattag 
aggctccctg cagaatgaaa gtttctattc 
aaactttcaa agcagagcct cccgagaagc 
aaaagtctgt tccaaataaa gccttggaat 
agatgttccc ttcagaatca aaacaaaaga 
gtctccgtga gactgtttca cagaaggatg 
aaatggataa aataagtgga aaattagaag 
cagttcattc ttgtgaaaga gcaagggaac 
gaaaaatgga acaaatgaaa aagaagtttt 
aagaaataaa atcacagtta gagaaccaaa 
tgagattgac tttaaaccaa gaagaagaga 
aaattaggga agaattagga agaatcgaag 
aacaacttga acaggctctc agaatacaag 
tgaatcaggt ttctcacact catgaaaatg 
tgaaaaagga aattgccatg ctaaaactgg 
aaaaggaaaa taaatacttt gaggacatta 
agatgacccc tcgtgcc 

<210> 468 
<211> 2307 
<212> DKA 

<213> Homo sapiens 



tagaagggtc tcctgttaaa gatggtcttc 60 

ttccaactaa agccttagaa ttgatggaca 120 

agccatctgc cttcgagcct gccattgaaa 180 

aattgaagaa tgaacaaaca ttgagagcag 240 

aggactatga agaaagttct tgggattctg 300 

atgtgtgttt acccaaggct gcgcatcaaa 360 

aagagtctcc tgataatgat ggttttctga 420 

caactaaagc cttagaattg atggacatgc 480 

catctgcctt cgagcctgcc attgaaatgc 540 

tgaagaatga acaaacattg agagcagatc 600 

aggttgaaga aaattcttgg gattctgaga 660 

tgtgtgtacc caaggctaca catcaaaaag 720 

attcaactag cctatcaaaa atcttggata 780 

ttcaaaaaga tcactgtgaa caacgtacag 840 

gtgtactgaa aaagaaactg tcagaagcaa 900 

aagttaaatg ggaacaagag ctctgcagtg 960 

agagaagaaa tgccgatata ttaaatgaaa 1020 

agcagcatag gaaagagtta gaagtgaaac 1080 

atatagaatt gaagagtgta gaaagtaatt 1140 

aaaattatct cttacatgaa aattgcatgt 1200 

aaatagccac actgaaacac caataccagg 1260 

agattttaaa agaaaagaat gctgaacttc 1320 

1337 



<400> 468 

attgagagca gatgagatac tcccatcaga 
ttgggattct gagagtctct gtgagactgt 
tacacatcaa aaagaaatag ataaaataaa 
tggtcttctg aaggctaact gcggaatgaa 
gatggacatg caaactttca aagcagagcc 
cattgaaatg caaaagtctg ttccaaataa 
gagagcagat gagatactcc catcagaatc 
ggattctgag agtctctgtg agactgtttc 
acatcaaaaa gaaatagata aaataaatgg 
ttttctgaag tctccctgca gaatgaaagt 
ggacatgcaa actttcaaag cagagcctcc 
tgaaatgcaa aagtctgttc caaataaagc 
agcagatcag atgttccctt cagaatcaaa 
ttctgagagt ctccgtgaga ctgtttcaca 
tcaaaaagaa atggataaaa taagtggaaa 



atccaaacaa aaggactatg aagaaagttc 60 
ttcacagaag gatgtgtgtt tacccaaggc 120 
tggaaaatta gaagggtctc ctgttaaaga 180 
agtttctatt ccaactaaag ccttagaatt 240 
tcccgagaag ccatctgcct tcgagcctgc 300 
agccttggaa ttgaagaatg aacaaacatt 360 
caaacaaaag gactatgaag aaagttcttg 420 
acagaaggat gtgtgtttac ccaaggctac 480 
aaaattagaa gagtctcctg ataatgatgg 540 
ttctattcca actaaagcct tagaattgat 600 
cgagaagcca tctgccttcg agcctgccat 660 
cttggaattg aagaatgaac aaacattgag 720 
acaaaagaac gttgaagaaa attcttggga 780 
gaaggatgtg tgtgtaccca aggctacaca 840 
attagaagat tcaactagcc tatcaaaaat 900 
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cttggataca gttcattctt gtgaaagagc aagggaactt caaaaagatc actgtgaaca 960 
acgtacagga aaaatggaac aaatgaaaaa gaagttttgt gtactgaaaa agaaactgtc 1020 
agaagcaaaa gaaataaaat cacagttaga gaaccaaaaa gttaaatggg aacaagagct 1080 
ctgcagtgtg aggtttctca cactcatgaa aatgaaaatt atctcttaca tgaaaattgc 1140 
atgttgaaaa aggaaattgc catgctaaaa ctggaaatag ccacactgaa acaccaatac 1200 
caggaaaagg aaaataaata ctttgaggac attaagattt taaaagaaaa gaatgctgaa 1260 
cttcagatga ccctaaaact gaaagaggaa tcattaacta aaagggcatc tcaatatagt 1320 
gggcagctta aagttctgat agctgagaac acaatgctca cttctaaatt gaaggaaaaa 1380 
caagacaaag aaatactaga ggcagaaatt gaatcaca'cc atcctagact ggcttctgct 1440 
gtacaagacc atgatcaaat tgtgacatca agaaaaagtc aagaacctgc tttccacatt 1500 
gcaggagatg cttgtttgca aagaaaaatg aatgttgatg tgagtagtac gatatataac 1560 
aatgaggtgc tccatcaacc actttctgaa gctcaaagga aatccaaaag cctaaaaatt 1620 
aatctcaatt atgcaggaga tgctctaaga gaaaatacat tggtttcaga acatgcacaa 1680 
agagaccaac gtgaaacaca gtgtcaaatg aaggaagctg aacacatgta tcaaaacgaa 1740 
caagataatg tgaacaaaca cactgaacag caggagtctc tagatcagaa attatttcaa 1800 
ctacaaagca aaaatatgtg gcttcaacag caattagttc atgcacataa gaaagctgac 1860 
aacaaaagca agataacaat tgatattcat tttcttgaga ggaaaatgca acatcatctc 1920 
ctaaaagaga aaaatgagga gatatttaat tacaataacc atttaaaaaa ccgtatatat 1980 
caatatgaaa aagagaaagc agaaacagaa aactcatgag agacaagcag taagaaactt 2040 
cttttggaga aacaacagac cagatcttta ctcacaactc atgctaggag gccagtccta 2100 
gcatcacctt atgttgaaaa tcttaccaat agtctgtgtc aacagaatac ttattttaga 2160 
agaaaaattc atgatttctt cctgaagcct acagacataa aataacagtg tgaagaatta 2220 
cttgttcacg aattgcataa agctgcacag gattcccatc taccctgatg atgcagcaga 2280 
catcattcaa tccaaccaga atctcgc 2307 

<210> 469 
<211> 650 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> VARIANT 

<222> 310, 429, 522 

<223> Xaa = Any Amino Acid 



<400> 469 



Met 


Ser 


Pro 


Ala 


Lys 


Glu 


Thr 


Ser Glu Lys Phe Thr Trp Ala 


Ala 


Lys 


1 








5 






10 


15 




Gly 


Arg 


Pro 


Arg 


Lys 


He 


Ala 


Trp Glu Lys Lys Glu Thr Pro 


Val 


Lys 








20 








25 30 






Thr 


Gly 


Cys 


Val 


Ala 


Arg 


Val 


Thr Ser Asn Lys Thr Lys Val 


Leu 


Glu 






35 










40 45 






Lys 


Gly 


Arg 


Ser 


Lys 


Met 


He 


Ala Cys Pro Thr Lys Glu Ser 


Ser 


Thr 




50 










55 


60 






Lys 


Ala 


Ser 


Ala 


TVsn 


Asp 


Gin 


Arg Phe Pro Ser Glu Ser Lys 


Gin 


Glu 


65 










70 




75 




80 


Glu 


Asp 


Glu 


Glu 


Tyr 


Ser 


Cys 


Asp Ser Arg Ser Leu Phe Glu 


Ser 


Ser 










85 






90 


95 




Ala 


Lys 


He 


Gin 


Val 


Cys 


He 


Pro Glu Ser He Tyr Gin Lys Val Met 








100 








105 110 






Glu 


He 


Asn 


Arg 


Glu 


Val 


Glu 


Glu Pro Pro Lys Lys Pro Ser 


Ala 


Phe 






115 










120 125 






Lys 


Pro 


Ala 


He 


Glu 


Met 


Gin 


Asn Ser Val Pro Asn Lys Ala 


Phe 


Glu 


130 










135 


140 






Leu 


Lys 


Asn 


Glu 


Gin 


Thr 


Leu 


Arg Ala Asp Pro Met Phe Pro 


Pro 


Glu 


145 










150 




155 




160 


Ser 


Lys 


Gin 


Lys 


Asp 


Tyr 


Glu 


Glu Asn Ser Trp Asp Ser Glu 


Ser 


Leu 










165 






170 


175 
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Cys 


Glu 


Thr 


Val 


Ser 


Gin Lys 


Asp Val Cys 








180 






185 


Gin 


Lys 


Glu 


He Asp Lys He 


Asn Gly Lys 






195 








200 


Lys 


Asp 


Gly 


Leu Leu Lys Ala 


Thr Cys Gly 




210 








215 




Thr 


Lys 


Ala 


Leu 


Glu 


Leu Lys 


Asp Met Gin 


225 










230 




Pro 


Gly 


Lys 


Pro 


Ser 


Ala Phe 


Glu Pro Ala 










245 




250 


Val 


Pro 


Asn 


Lys 


Ala 


Leu Glu 


Leu Lys Asn 








260 






265 


Asp 


Glu 


He 


Leu 


Pro 


Ser Glu 


Ser Lys Gin 






275 








280 


Ser 


Trp 


Asp 


Ser 


Glu 


Ser Leu 


Cys Glu Thr 




290 








295 




Cys 


Leu 


Pro 


Lys Ala Xaa His 


Gin Lys Glu 


305 










310 




Lys 


Leu 


Glu 


Gly Ser Pro Val 


Lys Asp Gly 










325 




330 


Gly 


Met 


Lys 


Val 


Ser 


He Pro 


Thr Lys Ala 








340 






345 


Gin 


Thr 


Phe 


Lys 


Ala 


Glu Pro 


Pro Glu Lys 






355 








360 


Ala 


lie 


Glu 


Met 


Gin 


Lys Ser 


Val Pro Asn 




370 








375 




Asn 


Glu 


Gin 


Thr Leu Arg Ala 


Asp Glu He 


385 










390 




Gin 


LVS 


Asp 


Tyr 


Glu 


Glu Ser 


Ser Trp Asp 










405 




410 


Thr 


Val 


Ser 


Gin Lys Asp Val 


Cys Leu Pro 








420 






425 


Glu 


He 


Asp 


Lys 


He Asn Gly 


Lys Leu Glu 






435 








440 


Gly 


Phe 


Leu 


Lys 


Ala 


Pro Cys 


Arg Met Lys 




450 








455 




Ala 


Leu 


Glu 


Leu 


Met 


Asp Met 


Gin Thr Phe 


465 










470 




Lys 


Pro 


Ser 


Ala 


Phe 


Glu Pro 


Ala He Glu 










485 




490 


Asn 


Lys 


Ala 


Leu 


Glu 


Leu Lys 


Asn Glu Gin 








500 






505 


Met 


Phe 


Pro 


Ser 


Glu 


Ser Lys 


Gin Lys Xaa 






515 








520 


Asp 


Ser 


Glu 


Ser Leu Arg Glu 


Thr Val Ser 




530 








535 




Pro 


Lys 


Ala 


Thr 


His 


Gin Lys 


Glu Met Asp 


545 










550 




Glu 


Asp 


Ser 


Thr 


Ser 


Leu Ser 


Lys He Leu 










565 




570 


Glu 


Arg 


Ala 


Arg 


Glu 


Leu Gin 


Lys Asp His 








580 






585 


Lys 


Met 


Glu 


Gin 


Met 


Lys Lys 


Lys Phe Cys 






595 








600 


Ser 


Glu 


Ala 


Lys 


Glu 


He Lys 


Ser Gin Leu 




610 








615 




Trp 


Glu 


Gin 


Glu 


Leu 


Cys Ser 


Val Arg Phe 


625 










630 
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Leu 


Pro 


Lys 


Ala 
190 


Thr 


His 


Leu 


Glu 


Glu 
205 


Ser 


Pro 


Asn 


Met 


Lys 
220 


Val 


Ser 


He 


Pro 


Thr 


Phe 


Lys 


Ala 


Glu 


Pro 


235 










240 


Thr 


Glu 


Met 


Gin 


Lvs 
255 


Ser 


Glu 


Gin 


Thr 


Leu 
270 


Arq 


Ala 


Lys 


Asp 


Tyr 
285 


Glu 


Glu 


Ser 


Val 


Ser 
300 


Gin 


Lvs 


Asp 


Val 


He 


Asp 


Lvs 


He 


Asn 


Glv 


315 










320 


Leu 


Leu 


Lys 


Ala 


Asn 
335 


Cys 


Leu 


Glu 


Leu 


Met 
350 


Asp 


Met 


Pro 


Ser 


Ala 
365 


Phe 


Glu 


Pro 


Lys 


Ala 
380 


Leu 


Glu 


Leu 


Lys 


Leu 


Pro 


Ser 


Glu 


Ser 


Lys 


395 










400 


Ser 


Glu 


Ser 


Leu 


Cys 
415 


Glu 


Lys 


Ala 


Xaa 


His 
430 


Gin 


Lys 


Glu 


Ser 


Pro 
445 


Asp 


Asn 


Asp 


Val 


Ser 
460 


He 


Pro 


Thr 


Lvs 


Lvs 


Ala 


Glu 


Pro 


Pro 


Glu 


475 










480 


Met 


Gin 


Lvs 


Ser 


Val 
495 


Pro 


Thr 


Leu 


Ara 


Ala 
510 


ASD 


Gin 


Val 


Glu 


Glu 
525 


Asn 


Ser 


Tro 


Gin 


Lys 
540 


Asp 


Val 


Cys 


Val 


Lys 


He 


Ser 


Gly 


Lys 


Leu 


555 










560 


Asp 


Thr 


Val 


His 


Ser 
575 


Cys 


Cys 


Glu 


Gin 


Arg 
590 


Thr 


Gly 


val 


Leu 


Lys 
605 


Lys 


Lys 


Leu 


Glu 


Asn 
620 


Gin 


Lys 


Val 


Lys 


Leu 


Thr 


Leu 


Met 


Lys 


Met 



635 640 
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Lys He He Ser Tyr Met Lys lie Ala Cys 
645 650 



<210> 470 
<211> 228 
<212> PRT 
<213> Homo sapiens 



<400> 470 

Met Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys 

15 10 15 

Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Thr Pro Val Lys 

20 25 30 

Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys Val Leu Glu 

35 40 45 

Lys Gly Arg Ser Lys Met He Ala Cys Pro Thr Lys Glu Ser Ser Thr 

50 55 60 

Lys Ala Ser Ala Asn Asp Gin Arg Phe Pro Ser Glu Ser Lys Gin Glu 

70 75 80 

Glu Asp Glu Glu Tyr Ser Cys Asp Ser Arg Ser Leu Phe Glu Ser Ser 

85 90 95 

Ala Lys He Gin Val Cys He Pro Glu Ser He Tyr Gin Lys Val Met 

100 105 110 

Glu He Asn Arg Glu Val Glu Glu Pro Pro Lys Lys Pro Ser Ala Phe 

115 120 125 

Lys Pro Ala He Glu Met Gin Asn Ser Val Pro Asn Lys Ala Phe Glu 

130 135 
Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Pro Met Phe Pro Pro Glu 
145 150 155 160 

Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu 

165 170 175 

Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His 

180 185 190 

Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Gly Lys Asn Arg 

155 200 205 

Phe Leu Phe Lys Asn Gin Leu Thr Glu Tyr Phe Ser Lys Leu Met Arg 

210 215 220 

Arg Asp He Leu 
225 



<210> 471 

<211> 154 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 
<222> 148 

<223> Xaa = Any Amino Acid 
<400> 471 

Met Arg Leu His Pro Trp Arg Lys Glu His Leu Thr Gin Leu Lys Ala 

1 5 . 10 15 

Trp Trp Lys Lys His Leu Met Arg Leu His Pro Trp Trp Lys Glu His 

20 25 30 

Leu Thr Arg Leu Lys Ala Trp Trp Lys Lys His Leu Met Arg Leu His 
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*i U 


Pro 


Trp 


Txp 


iurg KaxM ms iieu xnr 










HXS 


Leu 




SeiT Ser Asn Seir Gin 


65 






70 


Arg 


Val 


Leu 


Gin Lys Lys His Leu 








85 


Glu 


Asp 


Leu 


Gly Arg Ser His Gly 








100 


Gin 


Lys 


Lys 


Lys Lys Lys Lys Lys 






115 


120 


Lys 


Lys 


Lys 


Lys Lys Lys Lys Lys 




130 




135 


Lys 


Lys 


Lys 


Xaa Lys Lys Lys Lys 


145 






150 



45 

Lys Phe Asn Val Trp Arg Lys Arg 
60 

Gin Lys Lys His Leu Gly Lys Leu 

75 80 
Arg Asn Leu Arg Gly Gin Gin Lys 

90 95 
Arg Lys Lys Met Thr Gin Leu Arg 
105 110 
Lys Lys Lys Lys Lys Lys Lys Lys 
125 

Lys Lys Lys Lys Lys Lys Lys Lys 
140 

Lys Lys 



<210> 472 
<211> 466 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> VARIANT 
<222> 329 

<223> Xaa = Any Amino Acid 



<400> 472 



Met 


Ser 


Pro 


Ala 


Lys 


Glu 


Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys 


1 








5 




10 






15 




Gly 


Arg 


Pro 


Arg 


Lys 


He 


Ala Trp Glu Lys Lys 


Glu Thr 


Pro 


Val 


Lys 






20 






25 




30 






Thr 


Gly 


Cys 


Val 


Ala 


Arg 


Val Thr Ser Asn Lys 


Thr Lys 


Val 


Leu 


Glu 






35 








40 


45 








Lys 


Gly 


Arg 


Ser 


Lys 


Met 


He Ala Cys Pro Thr 


Lys Glu 


Ser 


Ser 


Thr 




50 










55 


60 








Lys 


Ala 


Ser 


Ala 


Asn 


Asp 


Gin Arg Phe Pro Ser 


Glu Ser Lys 


Gin 


Glu 


65 










70 


75 








80 


Glu 


Asp 


Glu 


Glu 


Tyr 


Ser 


Cys Asp Ser Arg Ser 


Leu Phe 


Glu 


Ser 


Ser 










85 




90 






95 




Ala 


Lys 


lie 


Gin 


Val 


Cys 


He Pro Glu Ser He 


Tyr Gin Lys 


Val 


Met 






100 






105 




110 






Glu 


lie 


Asn 


Arg 


Glu 


Val 


Glu Glu Pro Pro Lys 


Lys Pro 


Ser 


Ala 


Phe 






115 








120 


125 








Lys 


Pro 


Ala 


He 


Glu 


Met 


Gin Asn Ser Val Pro 


Asn Lys 


Ala 


Phe 


Glu 


130 










135 


140 








Leu 


Lys 


Asn 


Glu 


Gin 


Thr 


Leu 2Vrg Ala Asp Pro 


Met Phe 


Pro 


Pro 


Glu 


145 








150 


155 








160 


Ser 


Lys 


Gin 


Lys 


Asp 


Tyr 


Glu Glu Asn Ser Trp 


Asp Ser 


Glu 


Ser 


Leu 










165 




170 






175 




Cys 


Glu 


Thr 


Val 


Ser 


Gin 


Lys Asp Val Cys Leu 


Pro Lys 


Ala 


Thr 


His 






180 






185 




190 






Gin 


Lys 


Glu 


He 


Asp 


Lys 


He Asn Gly Lys Leu Glu Glu 


Ser 


Pro 


Asn 






195 








200 


205 








Lys 


Asp 


Gly 


Leu 


Leu 


Lys 


Ala Thr Cys Gly Met Lys Val 


Ser 


He 


Pro 


210 










215 


220 








Thr 


Lys 


Ala 


Leu 


Glu 


Leu 


Lys Asp Met Gin Thr Phe Lys 


Ala 


Glu 


Pro 


225 










230 


235 








240 
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Pro 


Gly 


Lys 


Pro 


Ser 


Ala Phe Glu Pro Ala Thr Glu Met 


Gin 


Lys Ser 










245 


250 




255 


Val 


Pro 


Asn Lys 


Ala 


Leu Glu Leu Lys Asn Glu Gin Thr 


Leu 


Arg Ala 








260 




265 


270 


Asp 


Glu 


He 


Leu 


Pro 


Ser Glu Ser Lys Gin Lys Asp Tyr 


Glu 


Glu Asn 






275 






280 285 






Ser 


Trp 


Asp 


Thr 


Glu 


Ser Leu Cys Glu Thr Val Ser Gin 


Lys 


Asp Val 




290 








295 300 




Cys 


Leu 


Pro Lys 


Ala 


Ala His Gin Lys Glu He Asp Lys 


He Asn Gly 


305 










310 315 




320 


Lys 


Leu 


Glu Gly Ser 


Pro Gly Lys Xaa Gly Leu Leu Lys 


Ala 


Asn Cys 










325 


330 




335 


Gly 


Met 


Lys 


Val 


Ser 


He Pro Thr Lys Ala Leu Glu Leu 


Met Asp Met 








340 




345 


350 




Gin 


Thr 


Phe 


Lys 


Ala 


Glu Pro Pro Glu Lys Pro Ser Ala 


Phe 


Glu Pro 






355 






360 365 






Ala 


lie 


Glu 


Met 


Gin 


Lys Ser Val Pro Asn Lys Ala Leu 


Glu 


Leu Lys 




370 








375 380 






Asn 


Glu 


Gin 


Thr 


Leu 


Arg Ala Asp Glu He Leu Pro Ser 


Glu 


Ser Lys 


385 










390 395 




400 


Gin 


Lys 


Asp 


Tyr 


Glu 


Glu Ser Ser Trp Asp Ser Glu Ser 


Leu 


Cys Glu 










405 


410 




415 


Thr 


Val 


Ser 


Gin 


Lys 


Asp Val Cys Leu Pro Lys Ala Ala 


His 


Gin Lys 








420 




425 


430 




Glu 


He 


Asp Lys 


He 


Asn Gly Lys Leu Glu Gly Lys Asn 


Arg 


Phe Leu 






435 






440 445 






Phe 


Lys 


Asn 


His 


Leu 


Thr Lys Tyr Phe Ser Lys Leu Met 


Arg 


Lys Asp 




450 








455 460 






He 


Leu 














465 

















<210> 473 
<211> 445 
<212> PRT 

<213> Homo sapiens 
<400> 473 



Lys 


Glu 


He 


Asp 


Lys 


He Asn Gly 


1 








5 




Asp 


Gly 


Leu 


Leu 


Lys 


Ala Asn Cys 








20 






Lys 


Ala 


Leu 


Glu 


Leu 


Met Asp Met 






35 






40 


Glu 


Lys 


Pro 


Ser 


Ala 


Phe Glu Pro 




50 








55 


Pro 


Asn 


Lys 


Ala 


Leu 


Glu Leu Lys 


65 










70 


Glu 


He 


Leu 


Pro 


Ser 


Glu Ser Lys 










85 




Trp 


Asp 


Ser 


Glu 


Ser 


Leu Cys Glu 








100 






Leu 


Pro 


Lys 


Ala 


Ala 


His Gin Lys 






115 






120 


Leu 


Glu 


Glu 


Ser 


Pro 


Asp Asn Asp 




130 








135 


Met 


Lys 


Val 


Ser 


He 


Pro Thr Lys 


145 










150 



Lys Leu Glu Gly Ser Pro Val Lys 

10 15 
Gly Met Lys Val Ser He Pro Thr 
25 30 
Gin Thr Phe Lys Ala Glu Pro Pro 
45 

Ala He Glu Met Gin Lys Ser Val 
60 

Asn Glu Gin Thr Leu Arg Ala Asp 

75 80 
Gin Lys Asp Tyr Glu Glu Ser Ser 

90 95 
Thr Val Ser Gin Lys Asp Val Cys 
105 110 
Glu He Asp Lys He Asn Gly Lys 
125 

Gly Phe Leu Lys Ala Pro Cys Arg 
140 

Ala Leu Glu Leu Met Asp Met Gin 
155 160 
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140 



Thr 


Phe 


Lys 


Ala 


Glu 
165 


Pro Pro 


Glu 


Lys 


Pro 
170 


Ser 


Ala 


Phe 


Glu 


Pro 
175 


Ala 


He 


Glu 


Met 


Gin 


Lys 


Ser Val 


Pro 


Asn 


Lys 


Ala 


Leu 


Glu 


Leu 


Lys 


Asn 








180 








185 










190 




Glu 


Gin 


Thr 


Leu 


Arg Ala Asp 


Gin 


Met 


Phe 


Pro 


Ser 


Glu 


Ser 


Lys 


Gin 






195 








200 










205 






Lys 


Lys 
210 


Val 


Glu 


Glu 


Asn Ser 
215 


Trp 


Asp 


Ser 


Glu 


Ser 
220 


Leu 


Arg 


Glu 


Thr 


Val 


Ser 


Gin 


Lys 


Asp Val Cys 


Val 


Pro 


Lys 


Ala 


Thr 


His 


Gin 


Lys 


Glu 


225 










230 








235 








240 


Met 


Asp 


Lys 


He 


Ser Gly Lys 


Leu 


Glu 


Asp 


Ser 


Thr 


Ser 


Leu 


Ser 


Lys 










245 








250 










255 




lie 


Leu 


Asp 


Thr 
260 


Val 


His Ser 


Cys 


Glu 
265 


Arg 


Ala 


Arg 


Glu 


Leu 
270 


Gin 


Lys 


Asp 


His 


Cys 


Glu 


Gin Arg Thr 


Gly 


Lys 


Met 


Glu 


Gin 


Met 


Lys 


Lys 


Lys 






275 








280 










285 








Phe 


Cys 
290 


Val 


Leu 


Lys 


Lys Lys 
295 


Leu 


Ser 


Glu 


Ala 


Lys 
300 


Glu 


He 


Lys 


Ser 


Gin 


Leu 


Glu 


Asn 


Gin Lys Val 


Lys 


Trp 


Glu 


Gin 


Glu 


Leu 


Cys 


Ser 


Val 


305 










310 








315 










320 


Arg 


Leu 


Thr 


Leu 


Asn 
325 


Gin Glu 


Glu 


Glu 


Lys 
330 


Arg 


Arg 


Asn 


Ala 


Asp 
335 


He 


Leu 


Asn 


Glu 


Lys 


He Arg Glu 


Glu 


Leu 


Gly 


Arg 


He 


Glu 


Glu 


Gin 


His 








340 








345 










350 






Arg 


Lvs 


Glu 
355 


Leu 


Glu 


Val Lys 


Gin 
360 


Gin 


Leu 


Glu 


Gin 


Ala 
365 


Leu 


Arg 


He 


Gin 


Asp 
370 


He 


Glu 


Leu 


Lys Ser 
375 


Val 


Glu 


Ser 


Asn 


Leu 
380 


Asn 


Gin 


Val 


Ser 


His 


Thr 


His 


Glu 


Asn 


Glu Asn 


Tyr 


Leu 


Leu 


His 


Glu 


Asn 


Cys 


Met 


Leu 


385 










390 








395 










400 


Lys 


Lys 


Glu 


He 


Ala 
405 


Met Leu 


Lys 


Leu 


Glu 
410 


He 


Ala 


Thr 


Leu 


Lys 
415 


His 


Gin 


Tyr 


Gin 


Glu 
420 


Lys 


Glu Asn 


Lys 


Tyr 
425 


Phe 


Glu 


Asp 


He 


Lys 
430 


He 


Leu 


Lys 


Glu 


Lys 
435 


Asn 


Ala 


Glu Leu 


Gin 
440 


Met 


Thr 


Pro 


Arg 


Ala 
445 









<210> 474 

<211> 3865 

<212> DNA 

<213> Homo sapiens 

<400> 474 

tccgagctga ttacagacac caaggaagat gctgtaaaga gtcagcagcc acagccctgg 60 
ctagctggcc ctgtgggcat ttattagtaa agttttaatg acaaaagctt tgagtcaaca 120 
cacccgtggg taattaacct ggtcatcccc accctggaga gccatcctgc ccatgggtga 180 
tcaaagaagg aacatctgca ggaacacctg atgaggctgc acccttggcg gaaagaacac 240 
ctgacacagc tgaaagcttg gtggaaaaaa cacctgatga ggctgcaccc ttggtggaaa 300 
gaacacctga cacggctgaa agcttggtgg aaaaaacacc tgatgaggct gcatccttgg 360 
tggagggaac atctgacaaa attcaatgtt tggagaaagc gacatctgga aagttcgaac 420 
agtcagcaga agaaacacct agggaaatta cgagtcctgc aaaagaaaca tctgagaaat 480 
ttacgtggcc agcaaaagga agacctagga agatcgcatg ggagaaaaaa gaagacacac 540 
ctagggaaat tatgagtccc gcaaaagaaa catctgagaa atttacgtgg gcagcaaaag 600 
gaagacctag gaagatcgca tgggagaaaa aagaaacacc tgtaaagact ggatgcgtgg 660 
caagagtaac atctaataaa actaaagttt tggaaaaagg aagatctaag atgattgcat 720 
gtcctacaaa agaatcatct acaaaagcaa gtgccaatga tcagaggttc ccatcagaat 780 
ccaaacaaga ggaagatgaa gaatattctt gtgattctcg gagtctcttt gagagttctg 840 
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caaagattca agtgtgtata cctgagtcta tatatcaaaa agtaatggag ataaatagag 900 
aagtagaaga gcctcctaag aagccatctg ccttcaagcc tgccattgaa atgcaaaact 960 
ctgttccaaa taaagccttt gaattgaaga atgaacaaac attgagagca gatccgatgt 1020 
tcccaccaga atccaaacaa aaggactatg aagaaaattc ttgggattct gagagtctct 1080 
gtgagactgt ttcacagaag gatgtgtgtt tacccaaggc tacacatcaa aaagaaatag 1140 
ataaaataaa tggaaaatta gaagagtctc ctaataaaga tggtcttctg aaggctacct 1200 
gcggaatgaa agtttctatt ccaactaaag ccttagaatt gaaggacatg caaactttca 1260 
aagcagagcc tccggggaag ccatctgcct tcgagcctgc cactgaaatg caaaagtctg 1320 
tcccaaataa agccttggaa ttgaaaaatg aacaaacatt gagagcagat gagatactcc 1380 
catcagaatc caaacaaaag gactatgaag aaagttcttg ggattctgag agtctctgtg 144 0 
agactgtttc acagaaggat gtgtgtttac ccaaggctrc rcatcaaaaa gaaatagata 1500 
aaataaatgg aaaattagaa gggtctcctg ttaaagatgg tcttctgaag gctaactgcg 1560 
gaatgaaagt ttctattcca actaaagcct tagaattgat ggacatgcaa actttcaaag 1620 
cagagcctcc cgagaagcca tctgccttcg agcctgccat tgaaatgcaa aagtctgttc 1680 
caaataaagc cttggaattg aagaatgaac aaacattgag agcagatgag atactcccat 1740 
cagaatccaa acaaaaggac tatgaagaaa gttcttggga ttctgagagt ctctgtgaga 1800 
ctgtttcaca gaaggatgtg tgtttaccca aggctrcrca tcaaaaagaa atagataaaa 1860 
taaatggaaa attagaagag tctcctgata atgatggttt tctgaaggct ccctgcagaa 1920 
tgaaagtttc tattccaact aaagccttag aattgatgga catgcaaact ttcaaagcag 1980 
agcctcccga gaagccatct gccttcgagc ctgccattga aatgcaaaag tctgttccaa 2040 
ataaagcctt ggaattgaag aatgaacaaa cattgagagc agatcagatg ttcccttcag 2100 
aatcaaaaca aaagaasgtt gaagaaaatt cttgggattc tgagagtctc cgtgagactg 2160 
tttcacagaa ggatgtgtgt gtacccaagg ctacacatca aaaagaaatg gataaaataa 2220 
gtggaaaatt agaagattca actagcctat caaaaatctt ggatacagtt cattcttgtg 2280 
aaagagcaag ggaacttcaa aaagatcact gtgaacaacg tacaggaaaa atggaacaaa 2340 
tgaaaaagaa gttttgtgta ctgaaaaaga aactgtcaga agcaaaagaa ataaaatcac 2400 
agttagagaa ccaaaaagtt aaatgggaac aagagctctg cagtgtgaga ttgactttaa 2460 
accaagaaga agagaagaga agaaatgccg atatattaaa tgaaaaaatt agggaagaat 2520 
taggaagaat cgaagagcag cataggaaag agttagaagt gaaacaacaa cttgaacagg 2580 
ctctcagaat acaagatata gaattgaaga gtgtagaaag taatttgaat caggtttctc 2640 
acactcatga aaatgaaaat tatctcttac atgaaaattg catgttgaaa aaggaaattg 2700 
ccatgctaaa actggaaata gccacactga aacaccaata ccaggaaaag gaaaataaat 2760 
actttgagga cattaagatt ttaaaagaaa agaatgctga acttcagatg accctaaaac 2820 
tgaaagagga atcattaact aaaagggcat ctcaatatag tgggcagctt aaagttctga 2880 
tagctgagaa cacaatgctc acttctaaat tgaaggaaaa acaagacaaa gaaatactag 2940 
aggcagaaat tgaatcacac catcctagac tggcttctgc tgtacaagac catgatcaaa 3000 
ttgtgacatc aagaaaaagt caagaacctg ctttccacat tgcaggagat gcttgtttgc 3060 
aaagaaaaat gaatgttgat gtgagtagta cgatatataa caatgaggtg ctccatcaac 3120 
cactttctga agctcaaagg aaatccaaaa gcctaaaaat taatctcaat tatgcmggag 3180 
atgctctaag agaaaataca ttggtttcag aacatgcaca aagagaccaa cgtgaaacac 3240 
agtgtcaaat gaaggaagct gaacacatgt atcaaaacga acaagataat gtgaacaaac 3300 
acactgaaca gcaggagtct ctagatcaga aattatttca actacaaagc aaaaatatgt 3360 
ggcttcaaca gcaattagtt catgcacata agaaagctga caacaaaagc aagataacaa 3420 
ttgatattca ttttcttgag aggaaaatgc aacatcatct cctaaaagag aaaaatgagg 3480 
agatatttaa ttacaataac catttaaaaa accgtatata tcaatatgaa aaagagaaag 3540 
cagaaacaga aaactcatga gagacaagca gtaagaaact tcttttggag aaacaacaga 3600 
ccagatcttt actcacaact catgctagga ggccagtcct agcatcacct tatgttgaaa 3660 
atcttaccaa tagtctgtgt caacagaata cttattttag aagaaaaatt catgatttct 3720 
tcctgaagcc tacagacata aaataacagt gtgaagaatt acttgttcac gaattgcata 3780 
aagctgcaca ggattcccat ctaccctgat gatgcagcag acatcattca atccaaccag 3840 
aatctcgctc tgtcactcag gctgg 3865 

<210> 475 

<211> 1002 

<212> PRT 

<213> Homo sapiens 



<220> 
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<221> VARIANT 

<222> 310, 429, 522 

<223> Xaa = Any Amino Acid 



<400> 475 



Met 


Ser 


Pro 


Ala Lys 


Glu 


Thr 


Ser Glu Lys 


Phe 


Thr 


Trp 


Ala 


Ala 


Lys 


1 






5 






10 










15 


Gly 


Arg 


Pro 


Arg Lys 


He 


Ala 


Trp Glu Lys 


Lys 


Glu 


Thr 


Pro 


Val 


Lys 








20 






25 








30 






Thr 


Gly 


Cys 


Val Ala 


Arg 


Val 


Thr Ser Asn 


Lys 


Thr 


Lys 


Val 


Leu 


Glu 






35 








40 






45 








Lys 


Gly 


Arg 


Ser Lys 


Met 


He 


Ala Cys Pro 


Thr 


Lys 


Glu 


Ser 


Ser 


Thr 




50 








55 






60 










Lys 


Ala 


Ser 


Ala Asn 


Asp 


Gin 


Arg Phe Pro 


Ser 


Glu 


Ser 


Lys 


Gin 


Glu 


65 








70 






75 










80 


Glu 


Asp 


Glu 


Glu Tyr 


Ser 


Cys 


Asp Ser Arg 


Ser 


Leu 


Phe 


Glu 


Ser 


Ser 


- 






85 






ao 










95 




Ala 


Lys 


lie 


Gin Val 


Cys 


He 


Pro Glu Ser 


He 


Tyr 


Gin 


Lys 


Val 


Met 








100 






105 








110 






Glu 


lie 


Asn 


Arg Glu 


Val 


Glu 


Glu Pro Pro 


Lys 


Lys 


Pro 


Ser 


Ala 


Phe 






115 








120 






125 








Lys 


Pro 


Ala 


He Glu 


Met 


Gin 


Asn Ser Val 


Pro 


Asn 


Lys 


Ala 


Phe 


Glu 




130 








135 






140 










Leu 


Lys 


Asn 


Glu Gin 


Thr 


Leu 


Arg Ala Asp 


Pro 


Met 


Phe 


Pro 


Pro 


Glu 


145 








150 






155 










160 


Ser 


Lys 


Gin 


Lys Asp 


Tyr 


Glu 


Glu Asn Ser 


Trp 


Asp 


Ser 


Glu 


Ser 


Leu 








165 






170 










175 




Cys 


Glu 


Thr 


Val Ser 


Gin 


Lys 


Asp Val Cys 


Leu 


Pro 


Lys 


Ala 


Thr 


His 








180 






185 








190 






Gin 


Lys 


Glu 


He Asp 


Lys 


He 


Asn Gly Lys 


Leu 


Glu 


Glu 


Ser 


Pro 


Asn 






195 








200 






205 








Lys 


Asp 


Gly 


Leu Leu 


Lys 


Ala 


Thr Cys Gly 


Met 


Lys 


Val 


Ser 


He 


Pro 




210 








215 






220 










Thr 


Lys 


Ala 


Leu Glu 


Leu 


Lys 


Asp Met Gin 


Thr 


Phe 


Lys 


Ala 


Glu 


Pro 


225 








230 






235 










240 


Pro 


Gly 


Lys 


Pro Ser 


Ala 


Phe 


Glu Pro Ala 


Thr 


Glu 


Met 


Gin 


Lys 


Ser 








245 






250 










255 




Val 


Pro 


Asn 


Lys Ala 


Leu 


Glu 


Leu Lys Asn 


Glu 


Gin 


Thr 


Leu 


Arg 


Ala 








260 






265 








270 






Asp 


Glu 


He 


Leu Pro 


Ser 


Glu 


Ser Lys Gin 


Lys 


Asp 


Tyr 


Glu 


Glu 


Ser 






275 








280 






285 








Ser 


Trp 


Asp 


Ser Glu 


Ser 


Leu 


Cys Glu Thr 


Val 


Ser 


Gin 


Lys 


Asp 


Val 




290 








295 






300 










Cys 


Leu 


Pro 


Lys Ala 


Xaa 


His 


Gin Lys Glu 


He 


Asp 


Lys 


He 


Asn 


Gly 


305 








310 






315 










320 


Lys 


Leu 


Glu 


Gly Ser 


Pro 


Val 


Lys Asp Gly 


Leu 


Leu 


Lys 


Ala 


Asn 


Cys 








325 






330 










335 




Gly 


Met 


Lys 


Val Ser 


He 


Pro 


Thr Lys Ala 


Leu 


Glu 


Leu 


Met 


Asp 


Met 








340 






345 








350 






Gin 


Thr 


Phe 


Lys Ala 


Glu 


Pro 


Pro Glu Lys 


Pro 


Ser 


Ala 


Phe 


Glu 


Pro 






355 








360 






365 








Ala 


lie 


Glu 


Met Gin 


Lys 


Ser 


Val Pro Asn 


Lys 


Ala 


Leu 


Glu 


Leu 


Lys 




370 








375 






380 










Asn 


Glu 


Gin 


Thr Leu 


Arg 


Ala 


Asp Glu He 


Leu 


Pro 


Ser 


Glu 


Ser 


Lys 


385 








390 






395 










400 


Gin 


Lys 


Asp 


Tyr Glu 


Glu 


Ser 


Ser Trp Asp 


Ser 


Glu 


Ser 


Leu 


Cys 


Glu 








405 






410 










415 




Thr 


Val 


Ser 


Gin Lys 


Asp 


Val 


Cys Leu Pro 


Lys 


Ala 


Xaa 


His 


Gin 


Lys 
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420 




ft J 


fZl n 




Asp 


Lys 


He 


Aou ^xy ■uya xicu 
















It lie 


Leu 


Lys Ala 


PT*r^ r*\7cs 3VT"rf M«3"t" 
Jrxu v*yo 












A^^ 


7Vl a 
n.Xa 


Leu 


V?1.U 


Leu 


Met 


Abp fitsi, OXll XllX 












^ / U 




Pro 


Ser 


Ala 


Phe 


^3X U S:J.\J rlXCL X XC 










485 




sn 


Lys 




Leu 


Glu 


T,on Ti\7Ci 21an fZT n 








500 




D\}0 




Phe 


Pro 


Ser 


Glu 


OCX ijyo Vsxn xjyo 






DID 






K o n 

DZV 


Asp 


Ser 


tiXU 


Ser 


Leu 


Arg urxu mr vai. 




530 










PlTO 


Lys 


Axa. 


Thr 


His 


uxn i-iys V3J.U fieu 


545 










CCA 

DD\J 




Asp 


Ser 


Thr 


Ser 


lieu oer j^ys ±xe 










565 




Glu 


Arg 


Ala 


Arg 


Glu 


Leu Gin Lys Asp 








580 




c o c 

DOD 


Lys 


Met 


Glu 


Gin 


Met 


Lys Lys Lys Phe 






595 






600 


Ser 


Glu 


Ala 


Lys 


Glu 


He Lys Ser Gin 




610 








615 


Trp 


Glu 


Gin 


Glu 


Leu 


Cys Ser Val Arg 


625 










D JO 


Glu 


Lys 


Arg 


Arg Asn 


Ala Asp He Leu 










645 




Leu 


Gly 


Arg 


He 


Glu 


Glu Gin His Arg 








660 




DOD 




Leu 


UXU 


Gin 


Ala 


Lteu iirg iie i^in 






£ T C 








Glu 


Ser 


Asn 


Leu 


Asn 


(,7xn vax oer ms 




690 








o y D 


Leu 


Leu 


His 


Glu 


Asn 


Cys Met Leu Lys 


705 










710 


Leu 


Glu 


He 


Ala 


Thr 


Leu Lys His Gin 










725 




Tyr 


Pfie 


Glu 


Asp 


He 


Lys He Leu Lys 








740 




745 


Met 


Thr 


Leu 


Lys 


Leu 


Lys Glu Glu Ser 






755 






760 


Tyr 


Ser 


Gly 


Gin 


Leu 


Lys Val Leu He 




770 








775 


Ser 


Lys 


Leu 


Lys 


Glu 


Lys Gin Asp Lys 


785 










790 


Glu 


Ser 


His 


His 


Pro 


Arg Leu Ala Ser 










805 




He 


Val 


Thr 


Ser Arg 


Lys Ser Gin Glu 








820 




825 


Asp 


Ala 


Cys 


Leu 


Gin 


Arg Lys Met Asn 






835 






840 


Tyr 


Asn 


Asn 


Glu 


Val 


Leu His Gin Pro 


850 








855 


Ser 


Lys 


Ser 


Leu 


Lys 


He Asn Leu Asn 


865 










870 


Glu 


Asn 


Thr 


Leu 


Val 


Ser Glu His Ala 
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430 



Glu 


Glu 


Ser 


Pro 


Asp 


Asn 


Asp 








445 








Xijr o 


Val 

V ClX 


Ser 


xxe 


Pro 


inr 


Lys 






460 








It lie 


Lys 


JtXXfX 


fi'\ 11 
^xu 


Pro 


Pro 


Glu 




475 










Aft A 


Glu 


Met 


Gin 


Lys 


Ser 


vax 


Pro 


490 










^ 27 3 




Pin 
V7X 11 


J. nx 


Leu 


Arg 


Axa 


Asp 


C:rln 










3 1 u 






Aaa 


vax 


PT n 

olU 


(iXU 


Asn 


Ser 


Trp 








K 0 c 
3^3 








oer 


vixn 


Lys 


Asp 


vai 


Cys 


vai 
















Asp 


Lys 


lie 


Ser 


Gly 


Lys 


Leu 




c; e; 










Db\) 


Leu 


Asp 


Thr 


Val 


His 


Ser 


Cys 


3/0 










3 /D 




His 


Cys 


Glu 


Gin 


Arg 


Thr 


Gly 










590 






Cys 


Val 


Leu 


Lys 


Lys 


Lys 


Leu 








CAE 
6O3 








Leu 


Glu 


Asn 


Gin 


Lys 


Val 


Lys 
















Leu 


Thr 


Leu 


Asn 


Gin 


Glu 


Glu 




c 0 c 
0 03 










640 


Asn 


Glu 


Lys 


lie 


Arg 


Glu 


Glu 












655 




Lys 


(srlU 


Leu 


Glu 


Val 


Lys 


Gin 










670 






Asp 


He 


Glu 


Leu 


Lys 


Ser 


Val 








685 








Thr 


His 


Glu 


Asn 


Glu 


Asn 


Xyr 






"7 A A 
/UU 










Lys 


Glu 


He 


Ala 


Met 


Leu 


Lys 




/ l3 










720 


Tyr 


Gin 


Glu 


Lys 


Glu 


Asn 


Lys 


730 










735 




Glu 


Lys 


Asn 


Ala 


Glu 


Leu 


Gin 










750 






Leu 


Thr 


Lys 


Arg 


Ala 


Ser 


Gin 








765 








Ala 


Glu 


Asn 


Thr 


Met 


Leu 


Thr 






/ oU 










Glu 


He 


Leu 


Glu 


Ala 


Glu 


He 




795 










800 


Ala 


Val 


Gin 


Asp 


His 


Asp 


Gin 


810 










815 




Pro 


Ala 


Phe 


His 


He 


Ala 


Gly 










830 






Val 


Asp 


Val 


Ser 


Ser 


Thr 


He 








845 








Leu 


Ser 


Glu 


Ala 


Gin 


Arg 


Lys 






860 










Tyr 


Ala 


Gly 


Asp 


Ala 


Leu 


Arg 




875 










880 


Gin 


Arg 


Asp 


Gin 


Arg 


Glu 


Thr 
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885 




890 










895 




Gin Cys 


Gin 


Met 


Lys Glu Ala Glu 


His 


Met 


Tyr 


Gin 


Asn 


Glu 


Gin 


Asp 






900 




905 










910 




Asn Val 


Asn 


Lys 


His Thr Glu Gin 


Gin 


Glu 


Ser 


Leu 


Asp 


Gin 


Lys 


Leu 




915 




920 










925 








Phe Gin 


Leu 


Gin 


Ser Lys Asn Met Trp 


Leu 


Gin 


Gin 


Gin 


Leu 


Val 


His 


930 






935 








940 










Ala His 


Lys 


Lys 


Ala Asp Asn Lys 


Ser 


Lys 


lie 


Thr 


He 


Asp 


He 


His 


945 






950 






955 










960 


Phe Leu 


Glu 


Arg 


Lys Met Gin His 


His 


Leu 


Leu 


Lys 


Glu 


Lys 


Asn 


Glu 








965 




970 










975 




Glu lie 


Phe 


Asn 


Tyr Asn Asn His 


Leu 


Lys 


Asn 


Arg 


He 


Tyr 


Gin 


Tyr 






980 




985 










990 






Glu Lys 


Glu 


Lys 


Ala Glu Thr Glu 


Asn 


ser 















995 1000 



<210> 476 
<211> 356 
<212> DNA 

<213> Homo sapiens 



<400> 476 

aggtctgccg gaaatgttag gcaccccaac 

ccctgccttg cttggcccat ccagtccagg 

tgcactttga aagacccctc ccactcctgg 

ctgggctctg ccaccccaca gtgggaaagg 

taggaagggg cctcaggagc aggaagggcc 

ttcctgcctc tccccttcct cctctgctct 



tcaagtccca ggccccaggc atctttcctg 60 
cgcctggagc aagtgctcag ctacttctcc 120 
cctcacattt ctctgtgtga tcccccactt 180 
ccaccctaga aagaagtccg ctggcaccca 240 
aggaccagaa ccttgcccac ggcaactgcc 300 
tgatctgtgt ttcaataaat taatgt 356 



<210> 477 
<211> 1876 
<212> DNA 

<213> Homo sapiens 



<400> 477 

atgacctgcg gatcaggatt tggtgggcgc 
cgccccggcc gctgctgcat caccgccgcc 
ctcaccgggg gcttcggcag ccacagcgtg 
cgcagcttcg gctaccgctc cgggggcgtg 
gtgtcggtca acgagagcct cctcacgccc 
tgcgtgaagc aggaggagaa ggagcagatc 
atcgacaagg tgcgcttcct ggagcagcag 
taccagaacc gcgagtgttg ccagagcaac 
actctgcggc gggaggccga gtgcgtggag 
aaccacgtgc aggaggtgct ggagggctac 
agagcaacag ctgagaacga gtttgtggct 
cgcaagtcag acctggaggc caacgtggag 
cggctgtatg aggaggagat ccgcattctc 
gtcaagctgg acaacagccg ggacctgaac 
cagtatgacg acattgtcac ccgcagccgg 
tgtgaggaga tgaaggccac ggtgatcagg 
gagatcaatg agctgaaccg catgatccaa 
tgccagaact ccaagctgga ggccgcggtg 
ctcagtgatg cccgctgcaa gctggccgag 
gacatggcct gcctgatcag ggagtaccag 
atcgagatcg ccacctacag gcgcctgctg 
attggggctg tgaatgtctg tgtcagcagc 



gccttcagct gcatctcggc ctgcgggccg 60 
ccctaccgtg gcatctcctg ctaccgcggc 120 
tgcggaggct ttcgggccgg ctcctgcgga 180 
tgcgggccca gtcccccatg catcaccacc 240 
ctcaacctgg agatcgaccc caacgcgcag 300 
aagtccctca acagcaggtt cgcggccttc 360 
aacaaactgc tggagacaaa gctgcagttc 420 
ctggagcccc tgtttgaggg ctacatcgag 480 
gccgacagcg ggaggctggc ctcagagctt 540 
aagaagaagt atgaggagga ggtttctctg 600 
ctgaagaagg atgtggactg cgcctacctc 660 
gccctgatcc aggagatcga cttcctgagg 720 
cagtcgcaca tctcagacac ctccgtggtt 780 
atggactgca tcattgccga gattaaggca 840 
gccgaggccg agtcctggta ccgcagcaag 900 
cacggggaga ccctgcgccg caccaaggag 960 
aggctgacgg ccgaggtgga gaatgccaag 1020 
gctcagtctg agcagcaggg tgaggcagcc 1080 
ctggagggcg ccctgcagaa ggccaagcag 1140 
gaggtgatga actccaagct gggcctggac 1200 
gagggcgagg agcagaggct atgtgaaggc 1260 
tcccggggcg gggtcgtgtg cggggacctc 1320 
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tgcgtgtcag gctcccggcc agtgactggc agtgtctgca gcgctccgtg caacgggaac 1380 

gtggcggtga gcaccggcct gtgtgcgccc tgcggccaat tgaacaccac ctgcggaggg 1440 

ggttcctgcg gcgtgggctc ctgtggtatc agctccctgg gtgtggggtc ttgcggcagc 1500 

agctgccgga aatgttaggc accccaactc aagtcccagg ccccaggcat ctttcctgcc 1560 

ctgccttgct tggcccatcc agtccaggcg cctggagcaa gtgctcagct acttctcctg 1620 

cactttgaaa gacccctccc actcctggcc tcacatttct ctgtgtgatc ccccacttct 1680 

gggctctgcc accccacagt gggaaaggcc accctagaaa gaagtccgct ggcacccata 1740 

ggaaggggcc tcaggagcag gaagggccag gaccagaacc ttgcccacgg caactgcctt 1800 

cctgcctctc cccttcctcc tctgctcttg atctgtgttt caataaatta atgtagccaa 1860 

aaaaaaaaaa aaaaaa 1876 

<210> 478 
<211> 505 
<212> PRT 

<213> Homo sapiens 
<400> 478 

Met Thr Cys Gly Ser Gly Phe Gly Gly Arg Ala Phe Ser Cys lie Ser 

15 10 15 

Ala Cys Gly Pro Arg Pro Gly Arg Cys Cys lie Thr Ala Ala Pro Tyr 

20 25 30 

Arg Gly lie Ser Cys Tyr Arg Gly Leu Thr Gly Gly Phe Gly Ser His 

35 40 45 

Ser Val Cys Gly Gly Phe Arg Ala Gly Ser Cys Gly Arg Ser Phe Gly 

50 55 60 

Tyr Arg Ser Gly Gly Val Cys Gly Pro Ser Pro Pro Cys He Thr Thr 
65 70 75 80 ■ 

Val Ser Val Asn Glu Ser Leu Leu Thr Pro Leu Asn Leu Glu lie Asp 

85 90 95 

Pro Asn Ala Gin Cys Val Lys Gin Glu Glu Lys Glu Gin He Lys Ser 

100 105 110 

Leu Asn Ser Arg Phe Ala Ala Phe He Asp Lys Val Arg Phe Leu Glu 

115 120 125 

Gin Gin Asn Lys Leu Leu Glu Thr Lys Leu Gin Phe Tyr Gin Asn Arg 

130 135 140 

Glu Cys Cys Gin Ser Asn Leu Glu Pro Leu Phe Glu Gly Tyr He Glu 
145 150 155 160 

Thr Leu Arg Arg Glu Ala Glu Cys Val Glu Ala Asp Ser Gly Arg Leu 

165 170 175 

Ala Ser Glu Leu Asn His Val Gin Glu Val Leu Glu Gly Tyr Lys Lys 

180 185 190 

Lys Tyr Glu Glu Glu Val Ser Leu Arg Ala Thr Ala Glu Asn Glu Phe 

195 200 205 

Val Ala Leu Lys Lys Asp Val Asp Cys Ala Tyr Leu Arg Lys Ser Asp 

210 . 215 220 

Leu Glu Ala Asn Val Glu Ala Leu He Gin Glu He Asp Phe Leu Arg 
225 230 235 240 

Arg Leu Tyr Glu Glu Glu He Arg He Leu Gin Ser His He Ser Asp 

245 250 255 

Thr Ser Val Val Val Lys Leu Asp Asn Ser Arg Asp Leu Asn Met Asp 

260 265 270 

Cys He He Ala Glu He Lys Ala Gin Tyr Asp Asp He Val Thr Arg 

275 280 285 

Ser Arg Ala Glu Ala Glu Ser Trp Tyr Arg Ser Lys Cys Glu Glu Met 

290 295 300 

Lys Ala Thr Val He Arg His Gly Glu Thr Leu Arg Arg Thr Lys Glu 
305 310 315 320 

Glu He Asn Glu Leu Asn Arg Met He Gin Arg Leu Thr Ala Glu Val 
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325 




Glu 


Ash 


Ala 




Cys Gin 


Asn Ser 








340 






Ser 


Glu 


Gin 


Gin 




Til a a 1 a 

AXa ilXa. 






355 








Ala 


Glu 


Leu 


Glu 


Gly Ala 






370 










Leu 


He 


Ar^ 


Glu 


J.yx \3Xli 


iaxu vax 


385 








390 




He 


Glu 


He 


Ala 


Thr Tyr 


nX ^ AX ^ 










405 




XiSu 


cys 


VsXU 


PI IT- 

oxy 


xxe \3±Y 


1 

Ala Val 








420 






Gly 


Gly 


Val 


Val 


Cys Gly 


Asp Leu 






435 






440 


Thr 


Gly 


Ser 


Val 


Cys Ser 


Ala Pro 




450 








455 


Thr 


Gly 


Leu 


Cys 


Ala Pro 


Cys Gly 


465 








470 




Gly 


Ser 


Cys 


Gly 


Val Gly 


Ser Cys 










485 




Ser 


Cys 


Gly 


Ser 


Ser Cys 


Arg Lys 








500 















335 




Uyo XlCU 


oXU AXa 


AXa 


Val 


Ala 


Gin 








1 C A 






xieu oer 


Asp Ala 


Arg 


Cys 


Lys 


Leu 












Lys AXa 


Lys Gin 


Asp 


Met 


Ala 


Cys 












ncu /loll 


Ser Lys 


Leu 


VaXy 


Leu 


Asp 




395 








400 


XicU XicU 


l^rXU ijXy 


GXu 


CjXU 


Gin 


Arg 


410 








415 




Asn Val 


Cys Val 


Ser 


Ser 


Ser 


Arg 


425 






430 




Cys Val 


Ser Gly 


Ser 


Arg 


Pro 


Val 






445 








Cys Asn 


Gly Asn 


-Val 


Ala 


Val 


Ser 




460 










Gin Leu 


Asn Thr 


Thr 


Cys 


Gly 


Gly 




475 








480 


Gly He 


Ser Ser 


Leu 


Gly 


Val 


Gly 


490 








495 





Cys 
505 



<210> 479 
<211> 221 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 22 

<223> n = A,T,C or G 
<400> 479 

ggtccattcc tttcctcgcg tnggggtttc tctgtgtcag cgagcctcgg tacactgatt 60 
tccgatcaaa agaatcatca tctttacctt gacttttcag ggaattactg aactttcttc 120 
tcagaagata gggcacagcc attgccttgg cctcacttga agggtctgca tttgggtcct ISO 
ctggtctctt gccaagtttc ccagccactc gagggagaaa t 221 

<210> 480 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 480 

cggcgaattc accatgggaa caagagctct gcagtg 36 

<210> 481 
<211> 62 
<212> DNA 

<213> TVrtificial Sequence 



<220> 
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<223> PGR primer 
<400> 481 

cggcaagctt ttaatggtga tggtgatgat gtataacttc tgtttctgct ttctcttttt 60 
ca 62 

<210> 482 
<211> 972 
<212> DNA 
<213> Homo sapiens 

<400> 482 

atgggaacaa gagctctgca gtgtgaggtt tctcacactc atgaaaatga aaattatctc 60 
ttacatgaaa attgcatgtt gaaaaaggaa attgccatgc taaaactgga aatagccaca 120 
ctgaaacacc aataccagga aaaggaaaat aaatactttg aggacattaa gattttaaaa 180 
gaaaagaatg ctgaacttca gatgacccta aaactgaaag aggaatcatt aactaaaagg 240 
gcatctcaat atagtgggca gcttaaagtt ctgatagctg agaacacaat gctcacttct 300 
aaattgaagg aaaaacaaga caaagaaata ctagaggcag aaattgaatc acaccatcct 360 
agactggctt ctgctgtaca agaccatgat caaattgtga catcaagaaa aagtcaagaa 420 
cctgctttcc acattgcagg agatgcttgt ttgcaaagaa aaatgaatgt tgatgtgagt 480 
agtacgatat ataacaatga ggtgctccat caaccacttt ctgaagctca aaggaaatcc 540 
aaaagcctaa aaattaatct caattatgcc ggagatgctc taagagaaaa tacattggtt 600 
tcagaacatg cacaaagaga ccaacgtgaa acacagtgtc aaatgaagga agctgaacac 660 
atgtatcaaa acgaacaaga taatgtgaac aaacacactg aacagcagga gtctctagat 720 
cagaaattat ttcaactaca aagcaaaaat atgtggcttc aacagcaatt agttcatgca 780 
cataagaaag ctgacaacaa aagcaagata acaattgata ttcattttct tgagaggaaa 840 
atgcaacatc atctcctaaa agagaaaaat gaggagatat ttaattacaa taaccattta 900 
aaaaaccgta tatatcaata tgaaaaagag aaagcagaaa cagaagttat acatcatcac 960 
catcaccatt aa 972 

<210> 483 
<211> 323 
<212> PRT 

<213> Homo sapiens 
<400> 483 

Met Gly Thr Arg Ala Leu Gin Cys Glu Val Ser His Thr His Glu Asn 

1 5 10 15 

Glu Asn Hyx Leu Leu His Glu Asn Cys Met Leu Lys Lys Glu lie Ala 

20 25 30 

Met Leu Lys Leu Glu lie Ala Thr Leu Lys His Gin Tyr Gin Glu Lys 

35 40 45 

Glu Asn Lys Tyr Phe Glu Asp lie Lys He Leu Lys Glu Lys Asn Ala 

50 55 60 

Glu Leu Gin Met Thr Leu Lys Leu Lys Glu Glu Ser Leu Thr Lys Arg 
65 70 75 80 

Ala Ser Gin Tyr Ser Gly Gin Leu Lys Val Leu He Ala Glu Asn Thr 

85 90 95 

Met Leu Thr Ser Lys Leu Lys Glu Lys Gin Asp Lys Glu He Leu Glu 

100 105 110 

Ala Glu He Glu Ser His His Pro Arg Leu Ala Ser Ala Val Gin Asp 

115 120 125 

His Asp Gin He Val Thr Ser Arg Lys Ser Gin Glu Pro Ala Phe His 

130 135 140 

He Ala Gly Asp Ala Cys Leu Gin Arg Lys Met Asn Val Asp Val Ser 
145 150 155 160 

Ser Thr He Tyr Asn Asn Glu Val Leu His Gin Pro Leu Ser Glu Ala 
165 170 175 
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Gin 


Arg 


Lys 


Ser 


Lys 


Ser 


Leu Lys 








180 








Ala 


Leu 


Arg 


Glu 


Asn 


Thr 


Leu Val 






195 








200 


Arg 


Glu 


Thr 


Gin 


Cys 


Gin 


Met Lys 




210 










215 


Glu 


Gin 


Asp 


Asn 


Val 


Asn 


Lys His 


225 










230 




Gin 


Lys 


Leu 


Phe 


Gin 


Leu 


Gin Ser 










245 






Leu 


Val 


His 


Ala 


His 


Lys 


Lys Ala 








260 








Asp 


He 


His 


Phe 


Leu 


Glu 


Arg Lys 






275 








280 


Lys 


Asn 


Glu 


Glu 


He 


Phe 


Asn Tyr 




290 










295 


Tyr 


Gin 


Tyr 


Glu Lys 


Glu 


Lys Ala 


305 










310 




His 


His 


His 











He 


Asn 


Leu 


Asn 


Tyr Ala Gly Asp 


185 








190 




Ser 


Glu 


His 


Ala 


Gin Arg Asp Gin 










205 




Glu 


Ala 


Glu 


His 


Met Tyr Gin 


Asn 








220 






Thr 


Glu 


Gin 


Gin 


Glu Ser Leu 


Asp 






235 






240 


Lys Asn 


Met 


Trp Leu Gin Gin 


Gin 




250 






255 




Asp Asn 


Lys 


Ser 


Lys lie Thr 


He 


265 








270 




Met 


Gin 


His 


His 


Leu Leu Lys 


Glu 










285 




Asn 


Asn 


His 


Leu Lys Asn Arg 


He 








300 






Glu 


Thr 


Glu 


Val 


He His His 


His 






315 






320 



<210> 484 

<211> 1518 

<212> DNA 

<213> Homo sapiens 

<400> 484 

atgacctgcg gatcaggatt tggtgggcgc 
cggcccggcc gctgctgcat caccgccgcc 
ctcaccgggg gcttcggcag ccacagcgtg 
cgcagcttcg gctaccgctc cgggggcgtg 
gtgtcggtca acgagagcct cctcacgccc 
tgcgtgaagc aggaggagaa ggagcagatc 
atcgacaagg tgcgcttcct ggagcagcag 
taccagaacc gcgagtgttg ccagagcaac 
actctgcggc gggaggccga gtgcgtggag 
aaccacgtgc aggaggtgct ggagggctac 
agagcaacag ctgagaacga gtttgtggct 
cgcaagtcag acctggaggc caacgtggag 
cggctgtatg aggaggagat ccgcattctc 
gtcaagctgg acaacagccg ggacctgaac 
cagtatgacg acattgtcac ccgcagccgg 
tgtgaggaga tgaaggccac ggtgatcagg 
gagatcaatg agctgaaccg catgatccaa 
tgccagaact ccaagctgga ggccgcggtg 
ctcagtgatg cccgctgcaa gctggccgag 
gacatggcct gcctgatcag ggagtaccag 
atcgagatcg ccacctacag gcgcctgctg 
attggggctg tgaatgtctg tgtcagcagc 
tgcgtgtcag gctcccggcc agtgactggc 
gtggcggtga gcaccggcct gtgtgcgccc 
ggttcctgcg gcgtgggctc ctgtggtatc 
agctgccgga aatgttag 



gccttccgct gcatctcggc ctgcgggccg 60 
ccctaccgtg gcatctcctg ctaccgcggc 120 
tgcggaggct ttcgggccgg ctcctgcgga 180 
tgcgggccca gtcccccatg catcaccacc 240 
ctcaacctgg agatcgaccc caacgcgcag 300 
aagtccctca acagcaggtt cgcggccttc 360 
aacaaactgc tggagacaaa gctgcagttc 420 
ctggagcccc tgtttgaggg ctacatcgag 480 
gccgacagcg ggaggctggc ctcagagctt 540 
aagaagaagt atgaggagga ggtttctctg 600 
ctgaagaagg atgtggactg cgcctacctc 660 
gccctgatcc aggagatcga cttcctgagg 720 
cagtcgcaca tctcagacac ctccgtggtt 780 
atggactgca tcattgccga gattaaggca 840 
gccgaggccg agtcctggta ccgcagcaag 900 
cacggggaga ccctgcgccg caccaaggag 960 
aggctgacgg ccgaggtgga gaatgccaag 1020 
gcccagtctg agcagcaggg tgaggcagcc 1080 
ctggagggcg ccctgcagaa ggccaagcag 1140 
gaggtgatga actccaagct gggcctggac 1200 
gagggcgagg agcagaggct atgtgaaggc 1260 
tcccggggcg gggtcgtgtg cggggacctc 1320 
agtgtctgca gcgctccgtg caacgggaac 1380 
tgcggccaat tgaacaccac ctgcggaggg 1440 
agctccctgg gtgtggggtc ttgcggcagc 1500 

1518 



<210> 485 
<211> 505 
<212> PRT 
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<213> Homo sapiens 
<400> 485 

Met Thr Cys Gly Ser Gly Phe Gly Gly Arg Ala Phe Arg Cys lie Ser 

15 10 15 

Ala Cys Gly Pro Arg Pro Gly Arg Cys Cys He Thr Ala Ala Pro Tyr 

20 25 30 

Arg Gly He Ser Cys Tyr Arg Gly Leu Thr Gly Gly Phe Gly Ser His 

35 40 45 

Ser Val Cys Gly Gly Phe Arg Ala Gly Ser Cys Gly Arg Ser Phe Gly 

50 55 60 

Tyr Arg Ser Gly Gly Val Cys Gly Pro Ser Pro Pro Cys He Thr Thr 
65 70 75 80 

Val Ser Val Asn Glu Ser Leu Leu Thr Pro Leu Asn Leu Glu He Asp 

85 90 95 

Pro Asn Ala Gin Cys Val Lys Gin Glu Glu Lys Glu Gin He Lys Ser 

100 105 110 

Leu Asn Ser Arg Phe Ala Ala Phe He Asp Lys Val Arg Phe Leu Glu 

115 120 125 

Gin Gin Asn Lys Leu Leu Glu Thr Lys Leu Gin Phe Tyr Gin Asn Arg 

130 135 140 

Glu Cys Cys Gin Ser Asn Leu Glu Pro Leu Phe Glu Gly Tyr He Glu 
145 150 155 160 

Thr Leu Arg Arg Glu Ala Glu Cys Val Glu Ala Asp Ser Gly Arg Leu 

165 170 175 

Ala Ser Glu Leu Asn His Val Gin Glu Val Leu Glu Gly Tyr Lys Lys 

180 185 190 

Lys Tyr Glu Glu Glu Val Ser Leu Arg Ala Thr Ala Glu Asn Glu Phe 

195 200 205 

Val Ala Leu Lys Lys Asp Val Asp Cys Ala Tyr Leu Arg Lys Ser Asp 

210 215 220 

Leu Glu Ala Asn Val Glu Ala Leu He Gin Glu He Asp Phe Leu Arg 
225 230 235 240 

Arg Leu Tyr Glu Glu Glu He Arg He Leu Gin Ser His He Ser Asp 

245 250 255 

Thr Ser Val Val Val Lys Leu Asp Asn Ser Arg Asp Leu Asn Met Asp 

260 265 270 

Cys He He Ala Glu He Lys Ala Gin Tyr Asp Asp He Val Thr Arg 

275 280 285 

Ser Arg Ala Glu Ala Glu Ser Trp Tyr Arg Ser Lys Cys Glu Glu Met 

290 295 300 

Lys Ala Thr Val He Arg His Gly Glu Thr Leu Arg Arg Thr Lys Glu 
305 310 315 320 

Glu He Asn Glu Leu Asn Arg Met He Gin Arg Leu Thr Ala Glu Val 

325 330 335 

Glu Asn Ala Lys Cys Gin Asn Ser Lys Leu Glu Ala Ala Val Ala Gin 

340 345 350 

Ser Glu Gin Gin Gly Glu Ala Ala Leu Ser Asp Ala Arg Cys Lys Leu 

355 360 365 

Ala Glu Leu Glu Gly Ala Leu Gin Lys Ala Lys Gin Asp Met Ala Cys 

370 375 380 

Leu He Arg Glu Tyr Gin Glu Val Met Asn Ser Lys Leu Gly Leu Asp 
385 390 395 400 

He Glu He Ala Thr Tyr Arg Arg Leu Leu Glu Gly Glu Glu Gin Arg 

405 410 415 

Leu Cys Glu Gly He Gly Ala Val Asn Val Cys Val Ser Ser Ser Arg 

420 425 430 

Gly Gly Val Val Cys Gly Asp Leu Cys Val Ser Gly Ser Arg Pro Val 
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435 440 445 

Thr GXy Ser Val Cys Ser Ala Pro Cys Asn Gly Asn Val Ala Val Ser 

450 455 460 

Thr Gly Leu Cys Ala Pro Cys Gly Gin Leu Asn Thr Thr Cys Gly Gly 
465 470 475 480 

Gly Ser Cys Gly Val Gly Ser Cys Gly lie Ser Ser Leu Gly Val Gly 

485 490 495 

Ser Cys Gly Ser Ser Cys Arg Lys Cys 
500 505 



<210> 486 
<211> 827 
<212> DNA 
<213> Homo sapiens 

<400> 486 

gcattctcca gtcgcacatc tcagacacct 
acctgaacat ggactgcatc attgccgaga 
gcagccgggc cgaggccgag tcctggtacc 
tgatcaggca cggggagacc ctgcgccgca 
tgatccaaag gctgacggcc gaggtggaga 
ccgcggtggc ccagtctgag cagcagggtg 
tggccgagct ggagggcgcc ctgcagaagg 
agtaccagga ggtgatgaac tccaagctgg 
gcctgctgga gggcgaggag cagaggctat 
tcagcagctc ccggggcggg gtcgtgtgcg 
tgactggcag tgtctgcagc gctccgtgca 
gtgcgccctg cggccaattg aacaccacct 
gtggtatcag ctccctgggt gtggggtctt 
cccaactcaa gtcccaggcc ccaggcatct 

<210> 487 
<211> 235 
<212> PRT 
<213> Homo sapiens 



ccgtggttgt caagctggac aacagccggg 60 
ttaaggcaca gtatgacgac attgtcaccc 120 
gcagcaagtg tgaggagatg aaggccacgg 180 
ccaaggagga gatcaatgag ctgaaccgca 240 
atgccaagtg ccagaactcc aagctggagg 300 
aggcagccct cagtgatgcc cgctgcaagc 360 
ccaagcagga catggcctgc ctgatcaggg 420 
gcctggacat cgagatcgcc acctacaggc 480 
gtgaaggcat tggggctgtg aatgtctgtg 540 
gggacctctg cgtgtcaggc tcccggccag 600 
acgggaacgt ggcggtgagc accggcctgt 660 
gcggaggggg ttcctgcggc gtgggctcct 720 
gcggcagcag ctgccggaaa tgttaggcac 780 
ttcctgccct gccttgc 827 



<400> 487 






Met Asp 


Cys 


He 


He Ala Glu He 
5 


1 

Thr Arg 


Ser 


Arg 


Ala Glu Ala Glu 






20 




Glu Met 


Lys 


Ala 


Thr Val He Arg 




35 




40 


Lys Glu 


Glu 


He 


Asn Glu Leu Asn 


50 






55 


Glu Val 


Glu 


Asn 


Ala Lys Cys Gin 


65 






70 


Ala Gin 


Ser 


Glu 


Gin Gin Gly Glu 








85 


Lys Leu 


Ala 


Glu 


Leu Glu Gly Ala 




100 




Ala Cys 


Leu 


He 


Arg Glu Tyr Gin 




115 




120 


Leu Asp 


He 


Glu 


He Ala Thr Tyr 


130 






135 


Gin Arg 


Leu 


Cys 


Glu Gly He Gly 


145 






150 



Lys Ala 


Gin Tyr 


Asp Asp He 


Val 


10 






15 




Ser Trp 


Tyr Arg 


Ser Lys Cys 


Glu 


25 






30 




His Gly 


Glu 


Thr 


Leu Arg Arg 


Thr 








45 




Arg Met 


He 


Gin 


Arg Leu Thr 


Ala 




60 






Asn Ser 


Lys 


Leu 


Glu Ala Ala 


Val 




75 






80 


Ala Ala 


Leu 


Ser 


Asp Ala Arg Cys 


90 






95 




Leu Gin 


Lys 


Ala 


Lys Gin Asp Met 


105 






110 


Gly 


Glu Val 


Met 


Asn 


Ser Lys Leu 








125 




Arg Arg 


Leu 


Leu 


Glu Gly Glu 


Glu 






140 






Ala Val 


Asn 


Val 


Cys Val Ser 


Ser 


155 




160 
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Ser Arg Gly GXy Val val Cys Gly Asp Leu Cys Val Ser Gly Ser Arg 

165 170 175 

Pro Val Thr Gly Ser Val Cys Ser Ala Pro Cys Asn Gly Asn Val Ala 

180 185 190 

Val Ser Thr Gly Leu Cys Ala Pro Cys Gly Gin Leu Asn Thr Thr Cys 

195 200 205 

Gly Gly Gly Ser Cys Gly Val Gly Ser Cys Gly lie Ser Ser Leu Gly 

210 215 220 

Val Gly Ser Cys Gly Ser Ser Cys Arg Lys Cys 
225 230 235 



<210> 488 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 488 

Ser Leu Thr Lys Arg Ala Ser Gin Tyr 
1 5 



<210> 489 
<211> 27 
<212> DNA 

<213> Homo sapiens 
<400> 489 

tcattaacta aaagggcatc tcaatat 

<210> 490 
<211> 3288 
<212> DNA 

<213> Homo sapiens 
<400> 490 

atgaagttgc tgatggtcct catgctggcg 

ggctgcccct tattggagaa tgtgatttcc 

gaatacaaag aacttcttca agagttcata 

gaattgaagg aatgttttct taaccaaacg 

atgcaattaa tatatgacag cagtctttgt 

tctgagaaat ttacgtgggc agcaaaagga 

gaaacacctg taaagactgg atgcgtggca 

gaaaaaggaa gatctaagat gattgcatgt 

gccaatgatc agaggttccc atcagaatcc 

gattctcgga gtctctttga gagttctgca 

tatcaaaaag taatggagat aaatagagaa 

ttcaagcctg ccattgaaat gcaaaactct 

gaacaaacat tgagagcaga tccgatgttc 

gaaaattctt gggattctga gagtctctgt 

cccaaggcta cacatcaaaa agaaatagat 

aataaagatg gtcttctgaa ggctacctgc 

ttagaattga aggacatgca aactttcaaa 

gagcctgcca ctgaaatgca aaagtctgtc 

caaacattga gagcagatga gatactccca 

agttcttggg attctgagag tctctgtgag 

aaggctrcrc atcaaaaaga aatagataaa 

aaagatggtc ttctgaaggc taactgcgga 



gccctctccc agcactgcta cgcaggctct 60 
aagacaatca atccacaagt gtctaagact 120 
gacgacaatg ccactacaaa tgccatagat 180 
gatgaaactc tgagcaatgt tgaggtgttt 240 
gatttattta tgagtcccgc aaaagaaaca 300 
agacctagga agatcgcatg ggagaaaaaa 360 
agagtaacat ctaataaaac taaagttttg 420 
cctacaaaag aatcatctac aaaagcaagt 480 
aaacaagagg aagatgaaga atattcttgt 540 
aagattcaag tgtgtatacc tgagtctata 600 
gtagaagagc ctcctaagaa gccatctgcc 660 
gttccaaata aagcctttga attgaagaat 720 
ccaccagaat ccaaacaaaa ggactatgaa 780 
gagactgttt cacagaagga tgtgtgttta 840 
aaaataaatg gaaaattaga agagtctcct 900 
ggaatgaaag tttctattcc aactaaagcc 960 
gcagagcctc cggggaagcc atctgccttc 1020 
ccaaataaag ccttggaatt gaaaaatgaa 1080 
tcagaatcca aacaaaagga ctatgaagaa 1140 
actgtttcac agaaggatgt gtgtttaccc 1200 
ataaatggaa aattagaagg gtctcctgtt 1260 
atgaaagttt ctattccaac taaagcctta 1320 
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gaattgatgg acatgcaaac tttcaaagca gagcctcccg agaagccatc tgccttcgag 1380 
cctgccattg aaatgcaaaa gtctgttcca aataaagcct tggaattgaa gaatgaacaa 1440 
acattgagag cagatgagat actcccatca gaatccaaac aaaaggacta tgaagaaagt 1500 
tcttgggatt ctgagagtct ctgtgagact gtttcacaga aggatgtgtg tttacccaag 1560 
gctrcrcatc aaaaagaaat agataaaata aatggaaaat tagaagagtc tcctgataat 1620 
gatggttttc tgaaggctcc ctgcagaatg aaagtttcta ttccaactaa agccttagaa 1680 
ttgatggaca tgcaaacttt caaagcagag cctcccgaga agccatctgc cttcgagcct 1740 
gccattgaaa tgcaaaagtc tgttccaaat aaagccttgg aattgaagaa tgaacaaaca 1800 
ttgagagcag atcagatgtt cccttcagaa tcaaaacaaa agaasgttga agaaaattct 1860 
tgggattctg agagtctccg tgagactgtt tcacagaagg atgtgtgtgt acccaaggct 1920 
acacatcaaa aagaaatgga taaaataagt ggaaaattag aagattcaac tagcctatca 1980 
aaaatcttgg atacagttca ttcttgtgaa agagcaaggg aacttcaaaa agatcactgt 2040 
gaacaacgta caggaaaaat ggaacaaatg aaaaagaagt tttgtgtact gaaaaagaaa 2100 
ctgtcagaag caaaagaaat aaaatcacag ttagagaacc aaaaagttaa atgggaacaa 2160 
gagctctgca gtgtgagatt gactttaaac caagaagaag agaagagaag aaatgccgat 2220 
atattaaatg aaaaaattag ggaagaatta ggaagaatcg aagagcagca taggaaagag 2280 
ttagaagtga aacaacaact tgaacaggct ctcagaatac aagatataga attgaagagt 2340 
gtagaaagta atttgaatca ggtttctcac actcatgaaa atgaaaatta tctcttacat 2400 
gaaaattgca tgttgaaaaa ggaaattgcc atgctaaaac tggaaatagc cacactgaaa 2460 
caccaatacc aggaaaagga aaataaatac tttgaggaca ttaagatttt aaaagaaaag 2520 
aatgctgaac ttcagatgac cctaaaactg aaagaggaat cattaactaa aagggcatct 2580 
caatatagtg ggcagcttaa agttctgata gctgagaaca caatgctcac ttctaaattg 2640 
aaggaaaaac aagacaaaga aatactagag gcagaaattg aatcacacca tcctagactg 2700 
gcttctgctg tacaagacca tgatcaaatt gtgacatcaa gaaaaagtca agaacctgct 2760 
ttccacattg caggagatgc ttgtttgcaa agaaaaatga atgttgatgt gagtagtacg 2820 
atatataaca atgaggtgct ccatcaacca ctttctgaag ctcaaaggaa atccaaaagc 2880 
ctaaaaatta atctcaatta tgcmggagat gctctaagag aaaatacatt ggtttcagaa 2940 
catgcacaaa gagaccaacg tgaaacacag tgtcaaatga aggaagctga acacatgtat 3000 
caaaacgaac aagataatgt gaacaaacac actgaacagc aggagtctct agatcagaaa 3060 
ttatttcaac tacaaagcaa aaatatgtgg cttcaacagc aattagttca tgcacataag 3120 
aaagctgaca acaaaagcaa gataacaatt gatattcatt ttcttgagag gaaaatgcaa 3180 
catcatctcc taaaagagaa aaatgaggag atatttaatt acaataacca tttaaaaaac 3240 
cgtatatatc aatatgaaaa agagaaagca gaaacagaaa actcatga 3288 

<210> 491 
<211> 2232 
<212> DNA 

<213> Homo sapiens 
<400> 491 

atgaagttgc tgatggtcct catgctggcg gccctctccc agcactgcta cgcaggctct 60 
ggctgcccct tattggagaa tgtgatttcc aagacaatca atccacaagt gtctaagact 120 
gaatacaaag aacttcttca agagttcata gacgacaatg ccactacaaa tgccatagat 180 
gaattgaagg aatgttttct taaccaaacg gatgaaactc tgagcaatgt tgaggtgttt 240 
atgcaattaa tatatgacag cagtctttgt gatttattta tgagtcccgc aaaagaaaca 300 
tctgagaaat ttacgtgggc agcaaaagga agacctagga agatcgcatg ggagaaaaaa 360 
gaaacacctg taaagactgg atgcgtggca agagtaacat ctaataaaac taaagttttg 420 
gaaaaaggaa gatctaagat gattgcatgt cctacaaaag aatcatctac aaaagcaagt 480 
gccaatgatc agaggttccc atcagaatcc aaacaagagg aagatgaaga atattcttgt 540 
gattctcgga gtctctttga gagttctgca aagattcaag tgtgtatacc tgagtctata 600 
tatcaaaaag taatggagat aaatagagaa gtagaagagc ctcctaagaa gccatctgcc 660 
ttcaagcctg ccattgaaat gcaaaactct gttccaaata aagcctttga attgaagaat 720 
gaacaaacat tgagagcaga tccgatgttc ccaccagaat ccaaacaaaa ggactatgaa 780 
gaaaattctt gggattctga gagtctctgt gagactgttt cacagaagga tgtgtgttta 840 
cccaaggcta cacatcaaaa agaaatagat aaaataaatg gaaaattaga agagtctcct 900 
aataaagatg gtcttctgaa ggctacctgc ggaatgaaag tttctattcc aactaaagcc 960 
ttagaattga aggacatgca aactttcaaa gcagagcctc cggggaagcc atctgccttc 1020 
gagcctgcca ctgaaatgca aaagtctgtc ccaaataaag ccttggaatt gaaaaatgaa 1080 
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caaacattga gagcagatga gatactccca tcagaatcca aacaaaagga ctatgaagaa 1140 
agttcttggg attctgagag tctctgtgag actgtttcac agaaggatgt gtgtttaccc 1200 
aaggctrcrc atcaaaaaga aatagataaa ataaatggaa aattagaagg gtctcctgtt 1260 
aaagatggtc ttctgaaggc taactgcgga atgaaagttt ctattccaac taaagcctta 1320 
gaattgatgg acatgcaaac tttcaaagca gagcctcccg agaagccatc tgccttcgag 1380 
cctgccattg aaatgcaaaa gtctgttcca aataaagcct tggaattgaa gaatgaacaa 1440 
acattgagag cagatgagat actcccatca gaatccaaac aaaaggacta tgaagaaagt 1500 
tcttgggatt ctgagagtct ctgtgagact gtttcacaga aggatgtgtg tttacccaag 1560 
gctrcrcatc aaaaagaaat agataaaata aatggaaaat tagaagagtc tcctgataat 1620 
gatggttttc tgaaggctcc ctgcagaatg aaagtttcta ttccaactaa agccttagaa 1680 
ttgatggaca tgcaaacttt caaagcagag cctcccgaga agccatctgc cttcgagcct 1740 
gccattgaaa tgcaaaagtc tgttccaaat aaagccttgg aattgaagaa tgaacaaaca 1800 
ttgagagcag atcagatgtt cccttcagaa tcaaaacaaa agaasgttga agaaaattct 1860 
tgggattctg agagtctccg tgagactgtt tcacagaagg atgtgtgtgt acccaaggct 1920 
acacatcaaa aagaaatgga taaaataagt ggaaaattag aagattcaac tagcctatca 1980 
aaaatcttgg atacagttca ttcttgtgaa agagcaaggg aacttcaaaa agate actgt 2040 
gaacaacgta caggaaaaat ggaacaaatg aaaaagaagt tttgtgtact gaaaaagaaa 2100 
ctgtcagaag caaaagaaat aaaatcacag ttagagaacc aaaaagttaa atgggaacaa 2160 
gagctctgca gtgtgaggtt tctcacactc atgaaaatga aaattatctc ttacatgaaa 2220 
attgcatgtt ga 2232 

<210> 492 
<211> 1233 
<212> DNA 

<213> Homo sapiens 
<40O> 492 

atgaagttgc tgatggtcct catgctggcg gccctctccc agcactgcta cgcaggctct 60 
ggctgcccct tattggagaa tgtgatttcc aagacaatca atccacaagt gtctaagact 120 
gaatacaaag aacttcttca agagttcata gacgacaatg ccactacaaa tgccatagat 180 
gaattgaagg aatgttttct taaccaaacg gatgaaactc tgagcaatgt tgaggtgttt 240 
atgcaattaa tatatgacag cagtctttgt gatttattta tgggaacaag agctctgcag 300 
tgtgaggttt ctcacactca tgaaaatgaa aattatctct tacatgaaaa ttgcatgttg 360 
aaaaaggaaa ttgccatgct aaaactggaa atagccacac tgaaacacca ataccaggaa 420 
aaggaaaata aatactttga ggacattaag attttaaaag aaaagaatgc tgaacttcag 480 
atgaccctaa aactgaaaga ggaatcatta actaaaaggg catctcaata tagtgggcag 540 
cttaaagttc tgatagctga gaacacaatg ctcacttcta aattgaagga aaaacaagac 600 
aaagaaatac tagaggcaga aattgaatca caccatccta gactggcttc tgctgtacaa 660 
gaccatgatc aaattgtgac atcaagaaaa agtcaagaac ctgctttcca cattgcagga 720 
gatgcttgtt tgcaaagaaa aatgaatgtt gatgtgagta gtacgatata taacaatgag 780 
gtgctccatc aaccactttc tgaagctcaa aggaaatcca aaagcctaaa aattaatctc 840 
aattatgccg gagatgctct aagagaaaat acattggttt cagaacatgc acaaagagac 900 
caacgtgaaa cacagtgtca aatgaaggaa gctgaacaca tgtatcaaaa cgaacaagat 960 
aatgtgaaca aacacactga acagcaggag tctctagatc agaaattatt tcaactacaa 1020 
agcaaaaata tgtggcttca acagcaatta gttcatgcac ataagaaagc tgacaacaaa 1080 
agcaagataa caattgatat tcattttctt gagaggaaaa tgcaacatca tctcctaaaa 1140 
gagaaaaatg aggagatatt taattacaat aaccatttaa aaaaccgtat atatcaatat 1200 
gaaaaagaga aagcagaaac agaagttata taa 1233 

<210> 493 
<211> 1095 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 

<222> 403, 522, 615 

<223> Xaa = Any Amino Acid 
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<400> 493 








Met 


Lys 


Leu Leu Met 


Val 


Leu 


Met 


1 




5 








Tyr 


Ala 


Gly Ser Gly Cys 


Pro 


Leu 






20 








lie 


Asn 


Pro Gin Val 


Ser 


Lys 


Thr 






35 






40 


Phe 


lie 


Asp Asp Asn Ala Thr Thr 




50 






55 




Cvs 


Phe 


Leu Asn Gin Thr Asp Glu 


65 






70 






Met 


Gin 


Leu lie Tyr Asp 


Ser 


Ser 






85 








Ala 


Lys 


Glu Thr Ser 


Glu 


Lys 


Phe 






100 








Arg 


Lys 


lie Ala Trp 


Glu 


Lys 


Lys 






115 






120 


Val 


Ala 


Arg Val Thr 


Ser 


Asn 


Lys 




130 






135 




Ser 


Lys 


Met lie Ala 


Cys 


Pro 


Thr 


145 






150 






Ala 


Asn 


Asp Gin Arg 


Phe 


Pro 


Ser 






165 








Glu 


Tvr 


Ser Cys Asp 


Ser 


Arg 


Ser 






180 








Gin 


Val 


Cys lie Pro 


Glu 


Ser 


He 






195 






200 




Glu 


Val Glu Glu 


Pro 


Pro 


Lys 




210 






215 




lie 


Glu 


Met Gin Asn 


Ser 


Val 


Pro 


225 






230 






Glu 


Gin 


Thr Leu Arg Ala Asp Pro 






245 








Lys 


Asp 


Tyr Glu Glu 


Asn 


Ser 


Trp 






260 








Val 


Ser 


Gin Lys Asp Val 


Cys 


Leu 






275 






280 


lie 


Asp 


Lys lie Asn 


Gly Lys 


Leu 




290 






295 




Leu. 


Leu 


Lys Ala Thr 


Cys 


Gly Met 


305 






310 






Leu 


Glu 


Leu Lys Asp Met 


Gin 


Thr 






325 








Pro 


Ser 


Ala Phe Glu 


Pro 


Ala 


Thr 






340 








Lys 


Ala 


Leu Glu Leu 


Lys 


Asn 


Glu 






355 






360 


Leu 


Pro 


Ser Glu Ser 


Lys 


Gin 


Lys 




370 






375 




Ser 


Glu 


Ser Leu Cys 


Glu 


Thr 


Val 


385 






390 






Lys 


Ala 


Xaa His Gin 


Lys 


Glu 


He 






405 








Gly 


Ser 


Pro Val Lys Asp Gly Leu 






420 








Val 


Ser 


lie Pro Thr 


Lys Ala Leu 






435 






440 



Leu 


Ala 


Ala 


Leu 


Ser 


Gin 


His 


Cvs 




10 










15 




Leu 


Glu 


Asn 


Val 


He 


Ser 


Lvs 


Thr 


25 










30 






Glu Tyr 


Lvs 


Glu 


Leu 


Leu 


Gin 


Glu 










45 








Asn 


Ala 


He 


Asp 


Glu 


Leu 


Lvs 


Glu 








60 










Thr 


Leu 


Ser 


Asn 


Val 


Glu 


Val 


Phe 






75 










80 


Leu Cys 


Asp 


Leu 


Phe 


Met 


Ser 


Pro 




90 










95 




Thr Trp 


Ala 


Ala 


Lys 


Glv 


Arg 


Pro 












110 








1 III. 


Pro 


Val 


Lys 


Thr 


Glv 


Cys 










125 








JL ILL. 


Lys 


Val 


Leu 


Glu 


Lys 


Gly 


Arg 








140 










Lys 


Glu 


Ser 


Ser 


Thr 


Lys 


Ala 


Ser 






155 










160 


Glu 


Ser 


Lys 


Gin 


Glu 


Glu 


Asp 


Glu 




170 










175 




Leu 


Phe 


Glu 


Ser 


Ser 


Ala 


Lys 


He 


185 










190 






Tyr 


Gin 


Lvs 


Val 


Met 


Glu 


He 


Asn 










205 








Lys 


Pro 


Ser 


Ala 


Phe 


Lys 


Pro 


Ala 








220 










Asn 


Lys 


Ala 


Phe 


Glu 


Leu 


Lys 


Asn 






235 










240 


Met 


Phe 


Pro 


Pro 


Glu 


Ser 


Lys 


Gin 




250 










255 




Asp 


Ser 


Glu 


Ser 


Leu 


Cys 


Glu 


Thr 


265 










270 






Pro 


Lys 


Ala 


Thr 


His 


Gin 


Lys 


Glu 










285 








Glu 


Glu 


Ser 


Pro 


Asn 


Lys 


Asp 


Glv 








300 










Lys 


Val 


Ser 


He 


Pro 


Thr 


Lys 


Ala 






315 










320 


Phe 


Lys 


Ala 


Glu 


Pro 


Pro 


Gly 


Lys 




330 










335 




Glu 


Met 


Gin 


Lys 


Ser 


Val 


Pro 


Asn 


345 










350 






Gin 


Thr 


Leu 


Arg 


Ala 


Asp 


Glu 


He 










365 








Asp Tyr 


Glu 


Glu 


Ser 


Ser 


Trp 


Asp 








380 










Ser 


Gin 


Lys 


Asp 


Val 


Cys 


Leu 


Pro 






395 










400 


Asp Lys 


He 


Asn 


Gly 


Lys 


Leu 


Glu 




410 










415 




Leu Lys 


Ala 


Asn 


Cys 


Gly 


Met 


Lys 


425 










430 






Glu 


Leu 


Met 


Asp 


Met 


Gin 


Thr 


Phe 



445 
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TiVS Al a 


Glu 


Pro 


Pro 


Glu 


Lys 


Pro 


Ser 


Ala 


Phe 


Glu 


Pro 


Ala 


He 


Glu 


450 










455 










460 












Lys 


Ser 


Val 


Pro 


Asn 


Lys 


Ala 


Leu 


Glu 


Leu 


Lys 


Asn 


Glu 


Gin 


465 








470 










475 










480 


J. ill. XjC u. 




Ala 


Asp 


Glu 


He 


Leu 


jrx\j 


Ser 


Glu 


Ser 


Lys 


Gin 


Lys 


Asp 








485 










490 










495 




Tyr GXu 


Glu 


Ser 


Ser 


lip 


Asp 


Ser 


Glu 


Ser 


Leu 


Cys 


Glu 


Thr 


Val 


Ser 






500 










505 










510 








Asp 


Val 


Cys 


Leu 


Pro 


Lys 


Ala 


Xaa 


His 


Gin 


Lys 


Glu 


He 


Asp 




SIS 










^ £ u 










525 








Ly s I Xs 


Asn. 


Glv 


Lys 


Leu 


Glu 


Glu 


Ser 


Pro 


Asp 


Asn 


Asp 


Glv 


Phe 


Leu 


530 










535 










540 












Pro 


Cys 


Arg 


Met 


Lys 


Val 


Ser 


He 


Pro 


Thr 


Lys 


Ala 


Leu 


Glu 


545 








550 










555 










560 




Asp 


Met 


Gin 


Thr 




Lys 


Ala 


Glu 


Pro 


Pro 


Glu 


ys 


Pro 


Ser 








D 3 










S7n 










S7S 

0 / hi 




a flic 


Glu 


Pro 


Ala 


He 


Glu 


Met 


Gin 


Lys 


Ser 


Val 


Pro 


Asn 


Lys 


Ala 


























590 






T.^an fil n 


Leu 


Lys 


Asn 


Glu 


Gin 


Thr 


Leu 


Arg 


Ala 


sp 


Gin 


Met 


Phe 


Pro 
























DUO 








- 


Ser 


Lys 


Gin 


ys 


Xaa 


Val 


Glu 


Glu 


Asn 


Ser 






Ser 


Glu 


O X u 










o xo 










0 z u 












Ar^ 


m n 


Thr 


Val 


Ser 


Gin 


ys 


As 


Val 


Cys 


Val 


Pro 


Lys 


Ala 










n 




















Dl U 


^hjc Hxs 


Gin 


Lys 


Glu 


Met 


sp 


ys 


He 


Ser 


Gly 


ys 


Le 


Glu 


Asp 


Ser 








Ortiz 










SD 










ASS 






Leu 


OCX 


Lys 


Tl 


Leu 


Asp 


.L IIX 


Val 
VaX 


His 


ser 


Cys 


fii 11 

uXU 


Arg 


Al A 

AX a 






O DU 


























- 

ruT^ GxU 


Leu 




Lys 


Asp 


CIXS 


Cys 




r:l n 


Arg 


xnx 


vrxy 


Lys 




vrXU 




fi7S 




















0 00 








tZl n M*=i-t- 
urXn rlcU 


Lys 


Lys 


Lys 




Cys 


VaX 


Leu 


Lys 


Lys 


Lys 


Leu 


06X 


d n 
VaXU 


al 5* 

AX a. 


690 




















700 










■LJjr O O J. LI 


He 


Lys 


Ser 


Gin 


Leu 


Glu 


Asn 


Gin 


Lys 


Val 


ys 


xrp 


Glu 


Gin 


/ y 3 








71 fl 










7 1 ^ 












f^l n T.pan 


Cys 


Ser 


Val 


Arg 


Leu 


Thr 


Leu 


Asn 


Gin 


Glu 


Glu 


Glu 


Lys 


Arg 








77S 




















7 TS 






Ala 


Asp 


He 


Leu 


Asn 


Glu 


Lys 


He 


Arg 


vjX U. 


uxu 


Leu 


V»XJf 


Arg 






740 










745 










7sn 

/ 0 V/ 






Tl fZl n 


Glu 


Gin 


Hi c: 


Arg 


Lys 


n 

V7XU 


Leu 


m n 

V7XU 


Val 


Lys 


Cl n 
vxn 


uXtt 


Leu 


/ii 11 




7SS 










/ 0 u 










/DO 








vrXn aXoL 


Leu 


Arg 


i.Xc 




Asp 


XX6 


VaXU 


Leu 


Lys 


Ser 


Va.X 


uXU 


Ser 


Asn 


/ f \J 










77c; 










7 on 












vtJLil 


V ClX 




H1.S 


xnx 


• 

Hxs 


PI 11 
urXU 


Asn 


uXU 


Asn 


Tyr 


Leu 


Leu 


Hxs 


*7 O C 


















*7 a c 










0 n n 


Gill Asn. 


Cys 


Met 


Leu 


Lys 


Lys 


Glu 


He 


Ala 


Met 


Leu 


Lys 


Leu 


Glu 


He 








805 










810 










815 




Ala Thr 


Leu 


Lys 


His 


Gin 


Tyr 


Gin 


Glu 


Lys 


Glu 


Asn 


Lys 


Tyr 


Phe 


Glu 






820 










825 










830 






Asp He 


Lys 


He 


Leu 


Lys 


Glu 


Lys 


Asn 


Ala 


Glu 


Leu 


Gin 


Met 


Thr 


Leu 




835 










840 










845 








Lys Leu 


Lys 


Glu 


Glu 


Ser 


Leu 


Thr 


Lys 


Arg 


Ala 


Ser 


Gin 


Tyr 


Ser 


Gly 


850 










855 










860 










Gin Leu 


Lys 


Val 


Leu 


He 


Ala 


Glu 


Asn 


Thr 


Met 


Leu 


Thr 


Ser 


Lys 


Leu 


865 








870 










875 










880 


Lys Glu 


Lys 


Gin 


Asp 


Lys 


Glu 


He 


Leu 


Glu 


Ala 


Glu 


He 


Glu 


Ser 


His 








885 










890 










895 




His Pro 


Arg 


Leu 


Ala 


Ser 


Ala 


Val 


Gin 


Asp 


His 


Asp 


Gin 


He 


Val 


Thr 



900 905 910 
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156 



Ser Arg Lys Ser Gin Glu Pro Ala Phe His lie Ala Gly Asp Ala Cys 

915 920 925 

Leu Gin Arg Lys Met Asn Val Asp Val Ser Ser Thr lie Tyr Asn Asn 

930 935 940 

Glu Val Leu His Gin Pro Leu Ser Glu Ala Gin Arg Lys Ser Lys Ser 
945 950 955 960 

Leu Lys lie Asn Leu Asn Tyr Ala Gly Asp Ala Leu Arg Glu Asn Thr 

965 970 975 

Leu Val Ser Glu His Ala Gin Arg Asp Gin Arg Glu Thr Gin Cys Gin 

980 985 990 

Met Lys Glu Ala Glu His Met Tyr Gin Asn Glu Gin Asp Asn Val Asn 

995 1000 1005 

Lys His Thr Glu Gin Gin Glu Ser Leu Asp Gin Lys Leu Phe Gin Leu 

1010 1015 1020 

Gin Ser Lys Asn Met Trp Leu Gin Gin Gin Leu Val His Ala His Lys 
1025 1030 1035 1040 

Lys Ala Asp Asn Lys Ser Lys lie Thr He Asp lie His Phe Leu Glu 

1045 1050 1055 

Arg Lys Met Gin His His Leu Leu Lys Glu Lys Asn Glu Glu He Phe 

1060 1065 1070 

Asn Tyr Asn Asn His Leu Lys Asn Arg He Tyr Gin Tyr Glu Lys Glu 

1075 1080 1085 

Lys Ala Glu Thr Glu Asn Ser 
1090 1095 



<210> 494 
<211> 743 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> VARIANT 

<222> 403, 522, 615 

<223> Xaa = Any Amino Acid 



<400> 494 
















Met Lys 


Leu Leu 


Met 


Val 


Leu 


Met 


Leu 


Ala Ala Leu Ser 


Gin His Cys 


1 




5 










10 


15 


Tyr Ala 


Gly Ser 


Gly 


Cys 


Pro 


Leu 


Leu 


Glu Asn Val He 


Ser Lys Thr 




20 










25 




30 


He Asn 


Pro Gin 


Val 


Ser 


Lys 


Thr 


Glu 


Tyr Lys Glu Leu 


Leu Gin Glu 




35 








40 




45 




Phe He 


Asp Asp 


Asn 


Ala 


Thr 


Thr 


Asn Ala He Asp Glu 


Leu Lys Glu 


50 








55 






60 


Cys Phe 


Leu Asn 


Gin 


Thr 


Asp 


Glu 


Thr 


Leu Ser Asn Val 


Glu Val Phe 


65 






70 








75 


80 


Met Gin 


Leu He 


Tyr 


Asp 


Ser 


Ser 


Leu 


Cys Asp Leu Phe 


Met Ser Pro 






85 










90 


95 


Ala Lys 


Glu Thr 


Ser 


Glu 


Lys 


Phe 


Thr 


Trp Ala Ala Lys 


Gly Arg Pro 




100 










105 




110 


Arg Lys 


He Ala 


Trp 


Glu 


Lys 


Lys 


Glu 


Thr Pro Val Lys 


Thr Gly Cys 




115 








120 




125 




Val Ala 


Arg Val 


Thr 


Ser 


Asn 


Lys 


Thr 


Lys Val Leu Glu 


Lys Gly Arg 


130 








135 






140 


Ser Lys 


Met He 


Ala 


Cys 


Pro 


Thr 


Lys 


Glu Ser Ser Thr 


Lys Ala Ser 


145 






150 








155 


160 


Ala Asn 


Asp Gin 


Arg 


Phe 


Pro 


Ser 


Glu 


Ser Lys Gin Glu 


Glu Asp Glu 
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165 170 175 

Glu Tyr Ser Cys Asp Ser Arg Ser Leu Phe Glu Ser Ser Ala Lys lie 

180 185 190 

Gin Val Cys lie Pro Glu Ser lie Tyr Gin Lys Val Met Glu lie Asn 

195 200 205 

Arg Glu Val Glu Glu Pro Pro Lys Lys Pro Ser Ala Phe Lys Pro Ala 

210 215 220 

He Glu Met Gin Asn Ser Val Pro Asn Lys Ala Phe Glu Leu Lys Asn 
225 230 235 240 

Glu Gin Thr Leu Arg Ala Asp Pro Met Phe Pro Pro Glu Ser Lys Gin 

245 250 255 

Lys Asp Tyr Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr 

260 265 270 

Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His Gin Lys Glu 

275 280 285 

He Asp Lys He Asn Gly Lys Leu Glu Glu Ser Pro Asn Lys Asp Gly 

290 295 300 

Leu Leu Lys Ala Thr Cys Gly Met Lys Val Ser He Pro Thr Lys Ala 
305 310 315 320 

Leu Glu Leu Lys Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Gly Lys 

325 330 335 

Pro Ser Ala Phe Glu Pro Ala Thr Glu Met Gin Lys Ser Val Pro Asn 

340 345 350 

Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Glu He 

355 360 365 

Leu Pro Ser Glu Ser Lys Gin Lys Asp Tyr Glu Glu Ser Ser Trp Asp 

370 375 380 

Ser Glu Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro 
385 390 395 400 

Lys Ala Xaa His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu 

405 410 415 

Gly Ser Pro Val Lys Asp Gly Leu Leu Lys Ala Asn Cys Gly Met Lys 

420 425 430 

Val Ser He Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin Thr Phe 

435 440 445 

Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala He Glu 

450 455 460 

Met Gin Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin 
465 470 475 480 

Thr Leu Arg Ala Asp Glu He Leu Pro Ser Glu Ser Lys Gin Lys Asp 

485 490 495 

Tyr Glu Glu Ser Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr Val Ser 

500 505 510 

Gin Lys Asp Val Cys Leu Pro Lys Ala Xaa His Gin Lys Glu He Asp 

515 520 525 

Lys He Asn Gly Lys Leu Glu Glu Ser Pro Asp Asn Asp Gly Phe Leu 

530 535 540 

Lys Ala Pro Cys Arg Met Lys Val Ser He Pro Thr Lys Ala Leu Glu 
545 550 555 560 

Leu Met Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser 

565 570 575 

Ala Phe Glu Pro Ala He Glu Met Gin Lys Ser Val Pro Asn Lys Ala 

580 585 590 

Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Gin Met Phe Pro 

595 600 605 

Ser Glu Ser Lys Gin Lys Xaa Val Glu Glu Asn Ser Trp Asp Ser Glu 

610 615 620 

Ser Leu Arg Glu Thr Val Ser Gin Lys Asp Val Cys Val Pro Lys Ala 



wo 02/083956 



625 630 



Thr His 


Gin 


Lys 


Glu 


Met 








645 




Thr Ser 


Leu 


Ser 


Lys 


He 






660 






Arg Glu 


Leu 


Gin Lys 


Asp 




675 








Gin Met 


Lys 


Lys 


Lys 


Phe 


690 










Lys Glu 


He 


Lys 


Ser 


Gin 


705 








710 


Glu Leu 


Cys 


Ser 


Val 


Arg 








725 




Ser Tyr 


Met 


Lys 


He 


Ala 



740 



<210> 495 
<211> 410 
<212> PRT 

<213> Homo sapiens 



<400> 495 



Met 


Lys 


Leu 


Leu 


Met 


Val 


1 








5 




Tyr 


Ala 


Gly 


Ser 


Gly 


Cys 








20 






He 


Asn 


Pro 


Gin 


Val 


Ser 






35 








Phe 


He 


Asp 


Asp 


Asn 


Ala 




50 










Cys 


Phe 


Leu 


Asn 


Gin 


Thr 


65 










70 


Met 


Gin 


Leu 


He 


Tyr 


Asp 










85 




Arg 


Ala 


Leu 


Gin 


Cys 


Glu 








100 






Leu 


Leu 


His 


Glu 


Asn 


Cys 






115 








Leu 


Glu 


He 


Ala 


Thr 


Leu 




130 










Tyr 


Phe 


Glu 


Asp 


He 


Lys 


145 










150 


Met 


Thr 


Leu 


Lys 


Leu 


Lys 










165 




Tyr 


Ser 


Gly 


Gin 


Leu 


Lys 








180 






Ser 


Lys 


Leu 


Lys 


Glu 


Lys 






195 








Glu 


Ser 


His 


His 


Pro 


Arg 




210 










He 


Val 


Thr 


Ser 


Arg 


Lys 


225 










230 


Asp 


Ala 


Cys 


Leu 


Gin 


Arg 










245 




Tyr 


Asn 


Asn 


Glu 


Val 


Leu 








260 






Ser 


Lys 


Ser 


Leu 


Lys 


He 









158 














635 




Asp 


Lys 


He 


Ser 


Gly 


Lys 








650 






Leu 


Asp 


Thr 


Val 


His 


Ser 






665 








His 


Cys 


Glu 


Gin 


Arg 


Thr 




680 










Cys 


Val 


Leu 


Lys 


Lys 


Lys 


695 










700 


Leu 


Glu 


Asn 


Gin 


Lys 


Val 










715 




Phe 


Leu 


Thr 


Leu 


Met 


Lys 








730 






Cys 












Leu 


Met 


Leu 


Ala 


Ala 


Leu 








10 






Pro 


Leu 


Leu 


Glu 


Asn 


Val 






25 








Lvs 


Thr 


Glu 


Tvr 


Lvs 

2 


Glu 




40 










Thr 


Thr 


Asn 


Ala 


He 


Asp 


55 










60 


ASD 


Glu 


Thr 


Leu 


Ser 


Asn 










75 




Ser 


Ser 


Leu 


Cvs 


Asp 


Leu 








90 






Val 


Ser 


His 


Thr 


His 


Glu 






105 








Met 


Leu 


Lvs 


Lys 


Glu 


He 




120 










Lys 


His 


Gin 


Tvr 


Gin 


Glu 


135 










140 


He 


Leu 


Lvs 


Glu 


Lys 


Asn 










155 




Glu 


Glu 


Ser 


Leu 


Thr 


Lvs 








170 






Val 


Leu 


He 


Ala 


Glu 


Asn 






185 








Gin 


ASD 


Lvs 


Glu 


lie 


Leu 




200 










Leu 


Ala 


Ser 


Ala 


Val 


Gin 


215 










220 


Ser 


Gin 


Glu 


Pro 


Ala 


Phe 










235 




Lys 


Met 


Asn 


Val 


Asp 


Val 








250 






His 


Gin 


Pro 


Leu 


Ser 


Glu 






265 








Asn 


Leu 


Asn 


Tyr 


Ala 


Gly 
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640 



Leu 


Glu 


Asp 


Ser 






655 




Cys 


iaJ-U 


Arg 






670 






Gly 


Lys 


Met 


Glu 


685 








Leu 


Ser 


Glu 


Ala 


Lys 


Trp 


Glu 


GXn 








720 


Met 


Lys 


iJLe 


ixe 






735 




Ser 


Gin 


His 


Cys 






15 




He 


Ser 


Lys 


Thr 




30 






Leu 


Leu 


Gin 


Glu 


45 








Glu 


Leu 


Lys 


Glu 


Val 


Glu 


Val 


Phe 










Pne 


Met 


Gly 


Thr 










Asn 


Glu 


Asn 


Tyr 




110 






Ala 


Met 


Leu 


Lys 


125 








Lys 


Glu 


Asn 


Lys 


Ala 


Glu 


Leu 


GXn 








160 


Arg 


Ai.a 


Ser 


oxn 






175 




Thr 


Met 


Leu 


Thr 




190 






Glu 


Ala 


Glu 


He 


205 








Asp 


His 


Asp 


Gin 


His 


He 


Ala 


Gly 








240 


Ser 


Ser 


Thr 


He 






255 




Ala 


Gin 


Arg 


Lys 




270 






Asp 


Ala 


Leu 


Arg 
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159 







275 


280 


GLu 


Asn 


Thr Leu Val Ser Glu 


His 




290 


295 




Gin 


Cys 


Gin Met Lys Glu Ala 


Glu 


305 




310 




Asn 


Val 


Asn Lys His Thr Glu 


Gin 






325 




Phe 


Gin 


Leu Gin Ser Lys Asn 


Met 






340 




Ala 


His 


Lys Lys Ala Asp Asn Lys 






355 


360 


Phe 


Leu 


Glu Arg Lys Met Gin His 




370 


375 




Glu 


He 


Phe Asn Tyr Asn Asn 


His 


385 




390 




Glu 


Lys 


Glu Lys Ala Glu Thr 


Glu 



405 



285 

Ala Gin Arg Asp Gin Arg Glu Thr 
300 

His Met Tyr Gin Asn Glu Gin Asp 
315 320 
Gin Glu Ser Leu Asp Gin Lys Leu 

330 335 
Trp Leu Gin Gin Gin Leu Val His 
345 350 
Ser Lys He Thr He Asp He His 
365 

His Leu Leu Lys Glu Lys Asn Glu 
380 

Leu Lys Asn Arg He Tyr Gin Tyr 
395 400 

Val He 
410 



<210> 496 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 496 

He Asp Glu Leu Lys Glu Cys Phe Leu Asn Gin Thr Asp Glu Thr Leu 

15 10 15 

Ser Asn Val Glu 
20 



<210> 497 

<211> 15 

<212> PRT 

<213> Homo sapiens 



<400> 497 

Thr Thr Asn Ala He Asp Glu Leu Lys Glu Cys Phe Leu Asn Gin 
15 10 15 



<210> 498 

<211> 21 

<212> PRT 

<213> Homo sapiens 



<400> 498 

Ser Gin His Cys Tyr Ala Gly Ser Gly Cys Pro Leu Leu Glu Asn Val 

15 10 15 

He Ser Lys Thr He 
20 



<210> 499 

<211> 20 

<212> PRT 

<213> Homo sapiens 
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<400> 499 

Glu Tyr Lys Glu Leu Leu Gin Glu Phe lie Asp Asp Asn Ala Thr Thr 

15 10 15 

Asn Ala lie Asp 
20 



<210> 500 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 500 

Lys Leu Leu Met Val Leu Met Leu Ala 
1 5 



<210> 501 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 501 

Gin Glu Phe lie Asp Asp Asn Ala Thr Thr Asn Ala lie 
1 5 ' 10 



<210> 502 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 502 

Leu Lys Glu Cys Phe Leu Asn Gin Thr Asp Glu Thr Leu 
15 10 



<210> 503 

<211> 93 

<212> PRT 

<213> Homo sapiens 

<400> 503 



Met 


Lys 


Leu Leu 


Met Val Leu Met Leu Ala 


Ala 


Leu 


Ser 


Gin His 


Cys 


1 






5 10 








15 




Tyr 


Ala 


Gly Ser 


Gly Cys Pro Leu Leu Glu 


Asn 


Val 


He 


Ser Lys 


Thr 






20 


25 








30 




lie 


Asn 


Pro Gin 


Val Ser Lys Thr Glu Tyr 


Lys 


Glu 


Leu 


Leu Gin 


Glu 






35 


40 






45 






Phe 


He 


Asp Asp 


Asn Ala Thr Thr Asn Ala 


lie 


Asp 


Glu 


Leu Lys 


Glu 




50 




55 




60 








Cys 


Phe 


Leu Asn 


Gin Thr Asp Glu Thr Leu 


Ser 


Asn 


Val 


Glu Val 


Phe 


65 






70 


75 








80 


Met 


Gin 


Leu He 


Tyr Asp Ser Ser Leu Cys Asp Leu Phe 







85 90 



<210> 504 
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<211> 1964 

<212> DNA 

<213> Homo sapiens 



<400> 504 

gcatgctcga cgccccatgt gctgaaaggg 
cctgcctcta cctgcgcacc tgcatgtgtt 
gacccaggtg cccgaggagg ccccgcttcg 
cctgtttctg gcggggggca tccgtggctc 
cttctgctac aaccctctga ccaacatctg 
agcccagctc aagctggtgg ccctggacgg 
gtacagcatg gagtgctacg acccgcgaac 
cgcaggcacc ttccctgtgg cccacgaggc 
cgggggtcac ctcttctacc gcctgctcag 
gtgcccatac agtgccagcc accggcgttc 
gtaccgcttc gacctgctgc ggggcgtggg 
cggctcctgg agcagggctg cctccctgcc 
caccctgggc aacaccattt actgcctcaa 
tggggggact gcccagttcc aggccaagga 
ggtcctcagt ccattcatcc tgactctgcc 
agtggcaggc agagaaccaa agctgcttcg 
gcctgagagg ccggggctca gggaaggggc 
gacaactttc cccttctgct ttaaaggttg 
cctctttctg cccctcactc cacacccaga 
acagaccctc tcagcttgct gacacccccc 
agggactgat gcgtctccac agacaaggac 
gaggaggggg tagaaaacat tcacacttcc 
acgtccccag gcaacgccct cgcctctggg 
taccaacccc gtagctatct gggtctgttt 
cttgcctctg aaactggtcc gctggtgcag 
ccagcccaca ccaggccagg cccactccgg 
ctcccagccc ctccagaagc ccaccccact 
ctgagcgtca ccctagttct gccccttttt 
ttccctttct ccttgtctat aaaatgtgga 
gggagacaag atcacagcta tgagcacctc 
atgatgcgtt ttctcccctt acgcactttg 
actgtgctcc actccaacct ccagcctgga 
tattctatgt tcagcaccac tggcaccaaa 



cgaggagcct cctgcggcgg cccctgtgtc 60 
caacccccgg gagaacacct ggcggcccct 120 
gggctgcggt ctctgcacca tgcacaacta 180 
cggtgccaag gccgtctgct ccaacgaggt 240 
gagccaggtt cggcccatgc agcaggcccg 300 
gctgctctat gccatcggtg gcgaatgcct 360 
agacgcctgg accccacgcg cgccactccc 420 
tgtggcctgc cgtggggaca tctacgtcac 480 
gtacagcccc gtgaaggatg cttgggacga 540 
cagcgacatc gttgcactgg ggggcttcct 600 
cgccgccgt.g atgcgctaca acacagtgac 660 
cctgcccgcc cccgccccac tgcgctgcac 720 
cccccaggtc actgccacct tcacggtctc 780 
gctgcagccc ttccccttgg ggagcaccgg 840 
ccctgaggac cggctgcaga cctcactctg 900 
ctgctctcca gggagaccct cctgggatgg 960 
tgggatcgga acttcctgct cttgtttctg 1020 
tcgattattt tgaagcccag actccctcag 1080 
ctgtttcctg actcaattcc gtacctactt 1140 
tgtctgtggg actccctatt ccctagagcc 1200 
ttggctcgct ggagctctgc tgagccgaga 1260 
tatgctctgt cagcaggaca gggagcaaaa 1320 
actttctgcc tgtcctaagg cctccccagg 1380 
ggcactgtgg attctcaagg gcctagaacc 1440 
ccctgctgtc tgcagctcct gcccataccc 1500 
gctcaccacc ctctgcagcc ttgtggggct 1560 
tctcgccaac ccccgatctc taaatgaggc 1620 
agctgtgtag acttggacga gacatttgac 1680 
cagtggacgt ctgtcaccca agagagttgt 1740 
gcacggtgtc caggatgcac agcacaatcc 1800 
aaacccatgc tagaaaagtg aatacatctg 1860 
tgtccctgtc tgggcccttt ttctgttttt 1920 
tacattttaa ttca 1964 



<210> 505 
<211> 732 
<212> DNA 
<213> Homo sapiens 



<400> 505 

atgcacaact acctgtttct ggcggggggc 
tccaacgagg tcttctgcta caaccctctg 
cagcaggccc gagcccagct caagctggtg 
ggcgaatgcc tgtacagcat ggagtgctac 
gcgccactcc ccgcaggcac cttccctgtg 
atctacgtca ccgggggtca cctcttctac 
gcttgggacg agtgcccata cagtgccagc 
gggggcttcc tgtaccgctt cgacctgctg 
aacacagtga ccggctcctg gagcagggct 
ctgcgctgca ccaccctggg caacaccatt 
ttcacggtct ctggggggac tgcccagttc 
gggagcaccg gggtcctcag tccattcatc 
acctcactct ga 



atccgtggct ccggtgccaa ggccgtctgc 60 
accaacatct ggagccaggt tcggcccatg 120 
gccctggacg ggctgctcta tgccatcggt 180 
gacccgcgaa cagacgcctg gaccccacgc 240 
gcccacgagg ctgtggcctg ccgtggggac 300 
cgcctgctca ggtacagccc cgtgaaggat 360 
caccggcgtt ccagcgacat cgttgcactg 420 
cggggcgtgg gcgccgccgt gatgcgctac 480 
gcctccctgc ccctgcccgc ccccgcccca 540 
tactgcctca acccccaggt cactgccacc 600 
caggccaagg agctgcagcc cttccccttg 660 
ctgactctgc cccctgagga ccggctgcag 720 

732 



wo 02/083956 



162 



PCT/US02/12378 



<210> 506 
<211> 729 
<212> DNA 
<213> Homo sapiens 



<400> 506 

atgcacaact acctgtttct ggcggggggc 
tccaacgagg tcttctgcta caaccctctg 
cagcaggccc gagcccagct caagctggtg 
ggcgaatgcc tgtacagcat ggagtgctac 
gcgccactcc ccgcaggcac cttccctgtg 
atctacgtca ccgggggtca cctcttctac 
gcttgggacg agtgcccata cagtgccagc 
gggggcttcc tgtaccgctt cgacctgctg 
aacacagtga ccggctcctg gagcagggct 
ctgcgctgca ccaccctggg caacaccatt 
ttcacggtct ctggggggac tgcccagttc 
gggagcaccg gggtcctcag tccattcatc 
acctcactc 



atccgtggct ccggtgccaa ggccgtctgc 60 

accaacatct ggagccaggt tcggcccatg 120 

gccctggacg ggctgctcta tgccatcggt 180 

gacccgcgaa cagacgcctg gaccccacgc 240 

gcccacgagg ctgtggcctg ccgtggggac 300 

cgcctgctca ggtacagccc cgtgaaggat 360 

caccggcgtt ccagcgacat cgttgcactg 420 

cggggcgtgg gcgccgccgt. gatgcgctac 480 

gcctccctgc ccctgcccgc ccccgcccca 540 

tactgcctca acccccaggt cactgccacc 600 

caggccaagg agctgcagcc cttccccttg 660 

ctgactctgc cccctgagga ccggctgcag 720 

729 



<210> 507 
<211> 243 
<212> PRT 
<213> Homo sapiens 



<400> 507 



Met 


His 


Asn 


Tyr 


Leu 


Phe 


Leu 


Ala 


Gly 


Gly 


He Arg 


Gly 


Ser 


Gly 


Ala 


1 








5 










10 








15 




Lys 


Ala 


Val 


Cys 


Ser 


Asn 


Glu 


Val 


Phe 


Cys 


Tyr Asn 


Pro 


Leu 


Thr 


Asn 








20 










25 








30 






lie 


Trp 


Ser 


Gin 


Val 


Arg 


Pro 


Met 


Gin 


Gin 


Ala Arg 


Ala 


Gin 


Leu 


Lys 






35 










40 








45 








Leu 


Val 


Ala 


Leu 


Asp 


Gly 


Leu 


Leu 


Tyr 


Ala 


He Gly 


Gly 


Glu 


Cys 


Leu 




50 










55 








60 










Tyr 


Ser 


Met 


Glu 


Cys 


Tyr 


Asp 


Pro 


Arg 


Thr 


Asp Ala 


Trp 


Thr 


Pro 


Arg 


65 










70 










75 








80 


Ala 


Pro 


Leu 


Pro 


Ala 


Gly 


Thr 


Phe 


Pro 


Val 


Ala His 


Glu 


Ala 


Val 


Ala 










85 










90 








95 




Cys 


Arg 


Gly 


Asp 


He 


Tyr 


Val 


Thr 


Gly 


Gly 


His Leu 


Phe 


Tyr 


Arg 


Leu 








100 










105 








110 






Leu 


Arg 


Tyr 


Ser 


Pro 


Val. 


Lys 


Asp 


Ala 


Trp 


Asp Glu 


Cys 


Pro 


Tyr 


Ser 






115 










120 








125 








Ala 


Ser 


His 


Arg 


Arg 


Ser 


Ser 


Asp 


He 


Val 


Ala Leu 


Gly 


Gly 


Phe 


Leu 




130 










135 








140 










Tyr 


Arg 


Phe 


Asp 


Leu 


Leu 


Arg 


Gly 


Val 


Gly 


Ala Ala 


Val 


Met 


Arg 


Tyr 


145 










150 










155 








160 


Asn 


Thr 


Val 


Thr 


Gly 


Ser 


Trp 


Ser 


Arg 


Ala 


Ala Ser 


Leu 


Pro 


Leu 


Pro 










165 










170 








175 




Ala 


Pro 


Ala 


Pro 


Leu 


Arg 


Cys 


Thr 


Thr 


Leu 


Gly Asn 


Thr 


He 


Tyr 


Cys 








180 










185 








190 






Leu 


Asn 


Pro 


Gin 


Val 


Thr 


Ala 


Thr 


Phe 


Thr 


Val Ser 


Gly 


Gly 


Thr 


Ala 






195 










200 








205 








Gin 


Phe 


Gin 


Ala 


Lys 


Glu 


Leu 


Gin 


Pro 


Phe 


Pro Leu 


Gly 


Ser 


Thr 


Gly 




210 










215 








220 










Val 


Leu 


Ser 


Pro 


Phe 


He 


Leu 


Thr 


Leu 


Pro 


Pro Glu 


Asp 


Arg 


Leu 


Gin 


225 










230 










235 








240 



Thr Ser Leu 
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<210> 508 
<211> 158 
<212> PRT 

<213> Homo sapiens 

<400> 508 ^ ^ 

Met His Asn Tyr Leu Phe Leu Ala Gly Gly He Arg Gly Ser Gly Ala 

15 10 15 

LVS Ala val Cys Ser Asn Glu Val Phe Cys Tyr Asn Pro Leu Thr Asn 

20 25 30 

He Trp Ser Gin Val Arg Pro Met Gin Gin Ala Arg Ala Gin Leu Lys 

35 40 45 

Leu val Ala Leu Asp Gly Leu Leu Tyr Ala He Gly Gly Glu Cys Leu 

50 55 60 

Tvr Ser Met Glu Cys Tyr Asp Pro Arg Thr Asp Ala Trp Thr Pro Arg 
65 70 75 80 

Ala Pro Leu Pro Ala Gly Thr Phe Pro Val Ala His Glu Ala Val Ala 

85 90 95 

Cys Arg Gly Asp He Tyr Val Thr Gly Gly His Leu Phe Tyr Arg Leu 

100 105 110 

Leu Arg Tyr Ser Pro Val Lys Asp Ala Trp Asp Glu Cys Pro Tyr Ser 

115 120 125 

Ala Ser His Arg Arg Ser Ser Asp He Val Ala Leu Gly Gly Phe Leu 

130 135 140 

Tyr Arg Phe Asp Leu Leu Arg Gly Val Gly Ala Ala Val Met 
145 150 155 



<210> 509 
<211> 85 
<212> PRT 

<213> Homo sapiens 

<400> 509 _ 
Arg Tyr Asn Thr Val Thr Gly Ser Trp Ser Arg Ala Ala Ser Leu Pro 

1 5 10 15 

Leu Pro Ala Pro Ala Pro Leu Arg Cys Thr Thr Leu Gly Asn Thr He 

20 25 30 

Tyr cys Leu Asn Pro Gin Val Thr Ala Thr Phe Thr Val Ser Gly Gly 

35 40 45 

Thr Ala Gin Phe Gin Ala Lys Glu Leu Gin Pro Phe Pro Leu Gly Ser 

50 55 60 

Thr Gly Val Leu Ser Pro Phe He Leu Thr Leu Pro Pro Glu Asp Arg 
65 70 75 80 

Leu Gin Thr Ser Leu 
85 



<210> 510 
<211> 732 
<212> DNA 
<213> Homo sapiens 



<400> 510 ^ 4. 4. 

atgcgacccc agggccccgc cgcctccccg cagcggctcc gcggcctcct gctgctcctg bu 
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ctgctgcagc tgcccgcgcc gtcgagcgcc tctgagatcc ccaaggggaa gcaaaaggcg 120 
cagctccggc agagggaggt ggtggacctg tataatggaa tgtgcttaca agggccagca 180 
ggagtgcctg gtcgagacgg gagccctggg gccaatgtta ttccgggtac acctgggatc 240 
ccaggtcggg atggattcaa aggagaaaag ggggaatgtc tgagggaaag ctttgaggag 300 
tcctggacac ccaactacaa gcagtgttca tggagttcat tgaattatgg catagatctt 360 
gggaaaattg cggagtgtac atttacaaag atgcgttcaa atagtgctct aagagttttg 420 
ttcagtggct cacttcggct aaaatgcaga aatgcatgct gtcagcgttg gtatttcaca 480 
ttcaatggag ctgaatgttc aggacctctt cccattgaag ctataattta tttggaccaa 540 
ggaagccctg aaatgaattc aacaattaat attcatcgca cttcttctgt ggaaggactt 600 
tgtgaaggaa ttggtgctgg attagtggat gttgctatct gggttggcac ttgttcagat 660 
tacccaaaag gagatgcttc tactggatgg aattcagttt ctcgcatcat tattgaagaa 720 
ctaccaaaat aa 732 



<210> 511 
<211> 729 
<212> DNA 
<213> Homo sapiens 

<400> 511 

atgcgacccc agggccccgc cgcctccccg 
ctgctgcagc tgcccgcgcc gtcgagcgcc 
cagctccggc agagggaggt ggtggacctg 
ggagtgcctg gtcgagacgg gagccctggg 
ccaggtcggg atggattcaa aggagaaaag 
tcctggacac ccaactacaa gcagtgttca 
gggaaaattg cggagtgtac atttacaaag 
ttcagtggct cacttcggct aaaatgcaga 
ttcaatggag ctgaatgttc aggacctctt 
ggaagccctg aaatgaattc aacaattaat 
tgtgaaggaa ttggtgctgg attagtggat 
tacccaaaag gagatgcttc tactggatgg 
ctaccaaaa 

<210> 512 
<211> 837 
<212> DNA 
<213> Homo sapiens 



cagcggctcc gcggcctcct gctgctcctg 60 

tctgagatcc ccaaggggaa gcaaaaggcg 120 

tataatggaa tgtgcttaca agggccagca 180 

gccaatgtta ttccgggtac acctgggatc 240 

ggggaatgtc tgagggaaag ctttgaggag 300 

tggagttcat tgaattatgg catagatctt 360 

atgcgttcaa atagtgctct aagagttttg 420 

aatgcatgct gtcagcgttg gtatttcaca 480 

cccattgaag ctataattta tttggaccaa 540 

attcatcgca cttcttctgt ggaaggactt 600 

gttgctatct gggttggcac ttgttcagat 660 

aattcagttt ctcgcatcat tattgaagaa 720 

729 



<400> 512 

atgcagcctg cggcggcctc ggagcgcggc ggagcagacg ctgaccacgt tcctctcctc 60 
ggtctcctcc gcctccagct ccgcgctgcc cggcagccgg gagccatgcg accccagggc 120 
cccgccgcct ccccgcagcg gctccgcggc ctcctgctgc tcctgctgct gcagctgccc 180 
gcgccgtcga gcgcctctga gatccccaag gggaagcaaa aggcgcagct ccggcagagg 240 
gaggtggtgg acctgtataa tggaatgtgc ttacaagggc cagcaggagt gcctggtcga 300 
gacgggagcc ctggggccaa tgttattccg ggtacacctg ggatcccagg tcgggatgga 360 
ttcaaaggag aaaaggggga atgtctgagg gaaagctttg aggagtcctg gacacccaac 420 
tacaagcagt gttcatggag ttcattgaat tatggcatag atcttgggaa aattgcggag 480 
tgtacattta caaagatgcg ttcaaatagt gctctaagag ttttgttcag tggctcactt 540 
cggctaaaat gcagaaatgc atgctgtcag cgttggtatt tcacattcaa tggagctgaa 600 
tgttcaggac ctcttcccat tgaagctata atttatttgg accaaggaag ccctgaaatg 660 
aattcaacaa ttaatattca tcgcacttct tctgtggaag gactttgtga aggaattggt 720 
gctggattag tggatgttgc tatctgggtt ggcacttgtt cagattaccc aaaaggagat 780 
gcttctactg gatggaattc agtttctcgc atcattattg aagaactacc aaaataa 837 

<210> 513 
<211> 837 
<212> DNA 
<213> Homo sapiens 
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<400> 513 

atgcagcctg cggcggcctc ggagcgcggc ggagcagacg ctgaccacgt tcctctcctc 60 
ggtctcctcc gcctccagct ccgcgctgcc cggcagccgg gagccatgcg accccagggc 120 
cccgccgcct ccccgcagcg gctccgcggc ctcctgctgc tcctgctgct gcagctgccc 180 
gcgccgtcga gcgcctctga gatccccaag gggaagcaaa aggcgcagct ccggcagagg 240 
gaggtggtgg acctgtataa tggaatgtgc ttacaagggc cagcaggagt gcctggtcga 300 
gacgggagcc ctggggccaa tgttattccg ggtacacctg ggatcccagg tcgggatgga 360 
ttcaaaggag aaaaggggga atgtctgagg gaaagctttg aggagtcctg gacacccaac 420 
tacaagcagt gttcatggag ttcattgaat tatggcatag atcttgggaa aattgcggag 480 
tgtacattta caaagatgcg ttcaaatagt gctctaagag ttttgttcag tggctcactt 540 
cggctaaaat gcagaaatgc atgctgtcag cgttggtatt tcacattcaa tggagctgaa 600 
tgttcaggac ctcttcccat tgaagctata atttatttgg accaaggaag ccctgaaatg 660 
aattcaacaa ttaatattca tcgcacttct tctgtggaag gactttgtga aggaattggt 720 
gctggattag tggatgttgc tatctgggtt ggcacttgtt cagattaccc aaaaggagat 780 
gcttctactg gatggaattc agtttctcgc atcattattg aagaactacc aaaataa 837 

<210> 514 
<211> 243 
<212> PRT 
<213> Homo sapiens 



<400> 514 



Met Arg Pro 


Gin 


Gly 


Pro 


Ala 


Ala 


Ser 


Pro 


Gin 


Arg 


Leu 


Arg Gly Leu 


1 




5 










10 








15 


Leu Leu Leu 


Leu 


Leu 


Leu 


Gin 


Leu 


Pro 


Ala 


Pro 


Ser 


Ser 


Ala Ser Glu 




20 










25 










30 


lie Pro Lys 


Gly 


Lys 


Gin 


i»ys 


Ala 


Gin 


Leu 


2^g 


Gin 


Arg 


Glu Val Val 


35 










40 










45 




Asp Leu Tyr 


Asn 


Gly 


Met 


Cys 


Leu 


Gin 


Gly 


Pro 


Ala 


Gly Val Pro Gly 


50 








55 










60 






Arg Asp Gly 


Ser 


Pro 


Gly 


Ala 


Asn 


Val 


He 


Pro 


Gly 


Thr Pro Gly He 


65 






70 










75 






80 


Pro Gly Arg 


Asp 


Gly 


Phe 


Lys 


Gly 


Glu 


Lys 


Gly 


Glu 


Cys 


Leu Arg Glu 






85 










90 








95 


Ser Phe Glu 


Glu 


Ser 


Trp 


Thr 


Pro 


Asn 


Tyr 


Lys 


Gin 


Cys 


Ser Trp Ser 




100 










105 










110 


Ser Leu Asn 


Tyr 


Gly 


He 


Asp 


Leu 


Gly 


Lys 


He 


Ala 


Glu 


Cys Thr Phe 


115 










120 










125 




Thr Lys Met 


Arg 


Ser 


Asn 


Ser 


Ala 


Leu 


Arg 


Val 


Leu 


Phe 


Ser Gly Ser 


130 








135 










140 




Leu Arg Leu 


Lys 


Cys 


Arg 


Asn 


Ala 


Cys 


Cys 


Gin 


Arg 


Trp 


Tyr Phe Thr 


145 






150 










155 






160 


Phe Asn Gly 


Ala 


Glu 


Cys 


Ser 


Gly 


Pro 


Leu 


Pro 


He 


Glu 


Ala He He 






165 










170 








175 


Tyr Leu Asp 


Gin 


Gly 


Ser 


Pro 


Glu 


Met 


Asn 


Ser 


Thr 


He 


Asn He His 




180 










185 










190 


Arg Thr Ser 


Ser 


Val 


Glu 


Gly 


Leu 


Cys 


Glu 


Gly 


He 


Gly Ala Gly Leu 


195 










200 










205 




Val Asp Val 


Ala 


He 


Trp 


Val 


Gly 


Thr 


Cys 


Ser 


Asp 


Tyr 


Pro Lys Gly 


210 








215 










220 






Asp Ala Ser 


Thr 


Gly 


Trp 


Asn 


Ser 


Val 


Ser 


Arg 


He 


He 


He Glu Glu 


225 






230 










235 






240 



Leu Pro Lys 



<210> 
<211> 



515 
278 
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<212> PRT 

<213> Homo sapiens 

<400> 515 



Met 


Gin 


Pro 


Ala 


Ala Ala 


Ser Glu Arg Gly Gly Ala Asp Ala 


Asp His 










5 


10 




15 


Val 


Pro 


Leu 


Leu 


Gly Leu 


Leu Arg Leu Gin Leu Arg Ala 


Ala 


Arg Gin 








20 




25 


30 




Pro 


Gly 


Ala 


Met 


Arg Pro 


Gin Gly Pro Ala Ala Ser Pro 


Gin 


Arg Leu 






35 






40 45 






Arg 


Gly 


Leu 


Leu 


Leu Leu 


Leu Leu Leu Gin Leu Pro Ala 


Pro 


Ser Ser 




50 








55 60 






Ala 


Ser 


Glu 


He 


Pro Lys 


Gly Lys Gin Lys Ala Gin Leu 


Arg 


Gin Arg 


65 








70 


75 




80 


Glu 


Val 


Val 


Asp 


Leu Tyr 


Asn Gly Met Cys Leu Gin Gly 


Pro 


Ala Gly 










85 


90 




95 


Val 


Pro 


Glv 


Arg 


Asp Gly 


Ser Pro Gly Ala Asn Val He 


Pro 


Gly Thr 








100 




105 


110 




Pro 


Glv 


He 


Pro 


Gly Arg 


Asp Gly Phe Lys Gly Glu Lys 


Gly 


Glu Cys 






115 






120 125 






Leu 


Arq 


Glu 


Ser 


Phe Glu 


Glu Ser Trp Thr Pro Asn Tyr 


Lys 


Gin Cys 




130 








135 140 






Ser 


Tro 


Ser 


Ser 


Leu Asn 


Tyr Gly He Asp Leu Gly Lys 


He 


Ala Glu 


145 








150 


155 




160 


Cys 


Thr 


Phe 


Thr 


Lys Met 


Arg Ser Asn Ser Ala Leu Arg Val 


Leu Phe 










165 


170 




175 


Ser 


Gly 


Ser 


Leu 


Arg Leu 


Lys Cys Arg Asn Ala Cys Cys 


Gin 


Arg Trp 








180 




185 


190 




Tyr 


Phe 


Thr 


Phe 


Asn Gly 


Ala Glu Cys Ser Gly Pro Leu Pro 


He Glu 






195 






200 205 






Ala 


He 


He 


Tyr 


Leu Asp 


Gin Gly Ser Pro Glu Met Asn Ser 


Thr He 




210 








215 220 






Asn 


He 


His 


Arg 


Thr Ser 


Ser Val Glu Gly Leu Cys Glu Gly 


He Gly 


225 








230 


235 




240 


Ala 


Gly 


Leu 


Val 


Asp Val 


Ala He Trp Val Gly Thr Cys 


Ser 


Asp Tyr 










245 


250 




255 


Pro 


Lys 


Gly 


Asp 


Ala Ser 


Thr Gly Trp Asn Ser Val Ser Arg 


He He 








260 




265 . 


270 




lie 


Glu 


Glu 


Leu 


Pro Lys 












275 













<210> 516 
<211> 197 
<212> PRT 
<213> Homo sapiens 

<400> 516 

Met Arg Pro Gin Gly Pro Ala Ala Ser Pro Gin Arg Leu Arg Gly Leu 

5 10 15 

Leu Leu Leu Leu Leu Leu Gin Leu Pro Ala Pro Ser Ser Ala Ser Glu 

20 25 30 

He Pro Lys Gly Lys Gin Lys Ala Gin Leu Arg Gin Arg Glu Val Val 

35 40 45 

Asp Leu Tyr Asn Gly Met Cys Leu Gin Gly Pro Ala Gly Val Pro Gly 

50 55 60 

Arg Asp Gly Ser Pro Gly Ala Asn Val He Pro Gly Thr Pro Gly He 
65 70 75 80 
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Piro 


tixy 


Arg 


Asp (jr±y 


Phe 


Lys 


i5i.y 
















Ser 


Pne 


Glu 


Glu Ser 


Trp 


Thr 


Pro 








100 








Ser 


Leu 


Asn 


Tyr Gly 


He 


Asp 


Leu 






115 








120 


Thr 


Lys 


Met 


Arg Ser 


Asn 


Ser 


Ala 




130 








135 




Leu 


Arg 


Leu 


Lys Cys 


Arg 


Asn 


Ala 


145 








150 






Phe 


Asn 


Gly 


Ala Glu 


Cys 


Ser 


Gly 








165 








Tyr 


Leu 


Asp 


Gin Gly 


Ser 


Pro 


Glu 








180 








Arg 


Thr 


Ser 


Ser Val 












195 










<210> 517 










<211> 232 










<212> PRT 










<213> Homo sapiens 








<400> 517 










Met 


Gin 


Pro 


Ala Ala 


Ala 


Ser 


Glu 


Val 


Pro 


Leu Leu Gly 


Leu 


Leu 


Arg 








20 








Pro 


Gly 


Ala Met Arg 


Pro 


Gin 


Gly 






35 








40 


Arg 


Gly 


Leu 


Leu Leu 


Leu 


Leu 


Leu 




50 








55 




Ala 


Ser 


Glu 


He Pro 


Lys 


Gly 


Lys 


65 








70 






Glu 


Val 


Val Asp Leu 


Tyr 


Asn 


Gly 








85 








Val 


Pro 


Gly Arg Asp 


Gly 


Ser 


Pro 








100 








Pro 


Gly 


He Pro Gly 


Arg 


Asp 


Gly 






115 








120 


Leu 


Arg 


Glu 


Ser Phe 


Glu 


Glu 


Ser 




130 








135 




Ser 


Trp 


Ser 


Ser Leu 


Asn 


Tyr 


Gly 


145 








150 






Cys 


Thr 


Phe Thr Lys 


Met 


Arg 


Ser 








165 








Ser 


Gly 


Ser 


Leu Arg 


Leu 


Lys 


Cys 








180 








Tyr 


Phe 


Thr 


Phe Asn 


Gly 


Ala 


Glu 






195 








200 


Ala 


He 


He 


Tyr Leu 


Asp 


Gin 


Gly 




210 








215 




Asn 


He 


His Arg Thr 


Ser 


Ser 


Val 


225 








230 







<210> 518 
<211> 46 



167 



CjIu 


Lys 


Gly 


talU 


Cys 


Leu 


Arg 


Glu 




90 










95 




Asn 


Tyr 


Lys 


Gin 


Cys 


Ser 


Trp 


Ser 


105 










110 






Gly 


Lys 


He 


Ala 


Glu 


Cys 


Thr 


Phe 










125 








Leu 


Arg 


Val 


Leu 


Phe 


Ser 


Gly 


Ser 








140 










Cys 


Cys 


Gin 


Arg 


Trp 


Tyr 


Phe 


Thr 






155 










160 


Pro 


Leu 


Pro 


He 


Glu 


Ala 


He 


He 




170 










175 






Asn 


Ser 


inr 


lie 


Asn 


X j.e 


nxS 


loo 










ion 

ly 0 






Arg 


Gly 


Gly 


Ala 


Asp 


Ala 


Asp 


His 




10 










15 




Leu 


Gin 


Leu 


Arg 


Ala 


Ala 


Arg 


Gin 


25 










30 






Pro 


Ala 


Ala 


Ser 


Pro 


Gin 


Arg 


Leu 










45 








Leu 


Gin 


Leu 


Pro 


Ala 


Pro 


Ser 


Ser 








60 










Gin 


Lys 


Ala 


Gin 


Leu 


Arg 


Gin 


Arg 






75 










80 


Met 


Cys 


Leu 


Gin 


tiy 


Pro 


Ala 


t»iy 




90 










95 




Giy 


Ala 


Asn 


val 


lie 


Pro 


Lriy 


Thr 


105 










110 






Phe 


Lys 


Gly 


Glu 


Lys 


Gly 


Glu 


Cys 










125 








Trp 


Thr 


Pro 


Asn 


Tyr 


Lys 


Gin 


Cys 








140 










He 


Asp 


Leu 


Gly 


Lys 


He 


Ala 


Glu 






155 










160 


Asn 


Ser 


Ala 


Leu 


Arg 


Val 


Leu 


Phe 




170 










175 




Arg 


Asn 


Ala 


Cys 


Cys 


Gin 


Arg 


Trp 


185 










190 






Cys 


Ser 


Gly 


Pro 


Leu 


Pro 


He 


Glu 










205 








Ser 


Pro 


Glu 


Met 


Asn 


Ser 


Thr 


He 








220 
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<212> PRT 

<213> Homo sapiens 
<400> 518 

Glu Gly Leu Cys Glu Gly He Gly Ala Gly Leu Val Asp Val Ala He 

5 10 15 

Trp Val Gly Thr Cys Ser Asp Tyr Pro Lys Gly Asp Ala Ser Thr Gly 

20 25 30 

Trp Asn Ser Val Ser Arg He He He Glu Glu Leu Pro Lys 
35 40 45 



<210> 519 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 519 

Cys Ser Asp Tyr Pro Lys Gly Asp Ala Ser Thr Gly Trp Asn Ser Val 

5 10 15 

Ser Arg He He He Glu Glu Leu Pro Lys 
20 25 



<210> 520 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 520 

aaaaatgagg agatatttaa ttacaataac catttaaaaa accgtatata tcaatatgaa 60 

<210> 521 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 521 

atgcaacatc atctcctaaa agagaaaaat gaggagatat ttaattacaa taaccattta 60 

<210> 522 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<4O0> 522 

gacaacaaaa gcaagataac aattgatatt cattttcttg agaggaaaat gcaacatcat 60 

<210> 523 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<4O0> 523 

aaaaatatgt ggcttcaaca gcaattagtt catgcacata agaaagctga caacaaaagc 60 

<210> 524 
<211> 63 
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<212> DNA 

<213> Homo sapiens 

<400> 524 

gatcagaaat tatttcaact acaaagcaaa aatatgtggc ttcaacagca attagttcat 60 
gca 

<210> 525 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 525 

actgaacagc aggagtctct agatcagaaa ttatttcaac tacaaagcaa aaatatgtgg 60 

<210> 526 

<211> 63 

<212> DNA 

<213> Homo sapiens 

<400> 526 

gctcaaagga aatccaaaag cctaaaaatt aatctcaatt atgccggaga tgctctaaga 60 
gaa 

<210> 527 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 527 

agtacgatat ataacaatga ggtgctccat caaccacttt ctgaagctca aaggaaatcc 60 

<210> 528 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 528 

agaaaaatga atgttgatgt gagtagtacg atatataaca atgaggtgct ccatcaacca 60 

<210> 529 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 529 

attgcaggag atgcttgttt gcaaagaaaa atgaatgttg atgtgagtag tacgatatat 60 

<210> 530 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 530 

aaaagtcaag aacctgcttt ccacattgca ggagatgctt gtttgcaaag aaaaatgaat 60 



<210> 531 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<400> 531 

gaaaataaat actttgagga cattaagatt ttaaaagaaa agaatgctga acttcagatg 60 

<210> 532 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 532 

ctgaaacacc aataccagga aaaggaaaat aaatactttg aggacattaa gattttaaaa 60 

<210> 533 

<211> 63 

<212> DNA 

<213> Homo sapiens 

<400> 533 

aattgcatgt tgaaaaagga aattgccatg ctaaaactgg aaatagccac actgaaacac 60 
caa 63 

<210> 534 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 534 

Asn Cys Met Leu Lys Lys Glu lie Ala Met Leu Lys Leu Glu lie Ala 

5 10 15 

Thr Leu Lys His Gin 
20 



<210> 535 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 535 

Leu Lys His Gin Tyr Gin Glu Lys Glu Asn Lys Tyr Phe Glu Asp lie 
5 10 15 

Lys lie Leu Lys 
20 



<210> 536 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 536 

Glu Asn Lys Tyr Phe Glu Asp He Lys He Leu Lys Glu Lys Asn Ala 
5' 10 15 

Glu Leu Gin Met 
20 
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<210> 537 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 537 

Lys Ser Gin Glu Pro Ala Phe His He Ala Gly Asp Ala Cys Leu Gin 
5 10 15 

Arg Lys Met Asn 
20 



<210> 538 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 538 

He Ala Gly Asp Ala Cys Leu Gin Arg Lys Met Asn Val Asp Val Ser 
5 10 15 

Ser Thr He Tyr 
20 



<210> 539 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 539 

Arg Lys Met Asn Val Asp Val Ser Ser Thr He Tyr Asn Asn Glu Val 
5 10 15 

Leu His Gin Pro 
20 



<210> 540 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 540 

Ser Thr He Tyr Asn Asn Glu Val Leu His Gin Pro Leu Ser Glu Ala 
5 10 15 

Gin Arg Lys Ser 
20 



<210> 541 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 541 

Ala Gin Arg Lys Ser Lys Ser Leu Lys He Asn Leu Asn Tyr Ala Gly 

5 10 15 

Asp Ala Leu Arg Glu 
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20 



<210> 542 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 542 

Thr Glu Gin Gin Glu Ser Leu Asp Gin Lys Leu Phe Gin Leu Gin Ser 
5 10 15 

Lys Asn Met Trp 
20 



<210> 543 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 543 

Asp Gin Lys Leu Phe Gin Leu Gin Ser Lys Asn Met Trp Leu Gin Gin 

5 10 15 

Gin Leu Val His Ala 
20 



<210> 544 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 544 

Lys Asn Met Trp Leu Gin Gin Gin Leu Val His Ala His Lys Lys Ala 
5 10 15 

Asp Asn Lys Ser 
20 



<210> 545 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 545 

Asp Asn Lys Ser Lys lie Thr lie Asp lie His Phe Leu Glu Arg Lys 
5 10 15 

Met Gin His His 
20 



<210> 546 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 546 

Met Gin His His Leu Leu Lys Glu Lys Asn Glu Glu He Phe Asn Tyr 
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10 



15 



Asn Asn His Leu 
20 



<210> 547 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<4O0> 547 

Lys Asn Glu Glu lie Phe Asn Tyr Asn Asn His Leu Lys Asn Arg lie 
5 10 15 

Tyr Gin Tyr Glu 
20 



<210> 548 

<211> 3045 

<212> DNA 

<213> Homo sapiens 

<400> 548 

atgcagcatc accaccatca ccacgtcggc 
aaatttacgt gggcagcaaa aggaagacct 
cctgtaaaga ctggatgcgt ggcaagagta 
ggaagatcta agatgattgc atgtcctaca 
gatcagaggt tcccatcaga atccaaacaa 
cggagtctct ttgagagttc tgcaaagatt 
aaagtaatgg agataaatag agaagtagaa 
cctgccattg aaatgcaaaa ctctgttcca 
acattgagag cagatccgat gttcccacca 
tcttgggatt ctgagagtct ctgtgagact 
gctacacatc aaaaagaaat agataaaata 
gat.ggtct.tc tgaaggctac ctgcggaatg 
ttgaaggaca tgcaaacttt caaagcagag 
gccactgaaa tgcaaaagtc tgtcccaaat 
ttgagagcag atgagatact cccatcagaa 
tgggatactg agagtctctg tgagactgtt 
gcgcatcaaa aagaaataga taaaataaat 
ggtcttctta aggctaactg cggaatgaaa 
atggacatgc aaactttcaa agcagagcct 
attgaaatgc aaaagtctgt tccaaataaa 
agagcagatg agatactccc atcagaatcc 
gattctgaga gtctctgtga gactgtttca 
catcaaaaag aaatagataa aataaatgga 
tttctgaagt ctccctgcag aatgaaagtt 
gacatgcaaa ctttcaaagc agagcctccc 
gaaatgcaaa agtctgttcc aaataaagcc 
gcagatcaga tgttcccttc agaatcaaaa 
tctgagagtc tccgtgagac tgtttcacag 
caaaaagaaa tggataaaat aagtggaaaa 
ttggatacag ttcattcttg tgaaagagca 
cgtacaggaa aaatggaaca aatgaaaaag 
gaagcaaaag aaataaaatc acagttagag 
tgcagtgtga gattgacttt aaaccaagaa 
aatgaaaaaa ttagggaaga attaggaaga 



tccatgagtc ccgcaaaaga aacatctgag 60 
aggaagatcg catgggagaa aaaagaaaca 120 
acatctaata aaactaaagt tttggaaaaa 180 
aaagaatcat ctacaaaagc aagtgccaat 240 
gaggaagatg aagaatattc ttgtgattct 300 
caagtgtgta tacctgagtc tatatatcaa 360 
gagcctccta agaagccatc tgccttcaag 420 
aataaagcct ttgaattgaa gaatgaacaa 480 
gaatccaaac aaaaggacta tgaagaaaat 540 
gtttcacaga aggatgtgtg tttacccaag 600 
aatggaaaat tagaagagtc tcctaataaa 660 
aaagtttcta ttccaactaa agccttagaa 720 
cctccgggga agccatctgc cttcgagcct 780 
aaagccttgg aattgaaaaa tgaacaaaca 840 
tccaaacaaa aggactatga agaaaattct 900 
tcacagaagg atgtgtgttt acccaaggct 960 
ggaaaattag aagggtctcc tggtaaagat 1020 
gtttctattc caactaaagc cttagaattg 1080 
cccgagaagc catctgcctt cgagcctgcc 1140 
gccttggaat tgaagaatga acaaacattg 1200 
aaacaaaagg actatgaaga aagttcttgg 1260 
cagaaggatg tgtgtttacc caaggctgcg 1320 
aaattagaag agtctcctga taatgatggt 1380 
tctattccaa ctaaagcctt agaattgatg 1440 
gagaagccat ctgccttcga gcctgccatt 1500 
ttggaattga agaatgaaca aacattgaga 1560 
caaaagaacg ttgaagaaaa ttcttgggat 1620 
aaggatgtgt gtgtacccaa ggctacacat 1680 
ttagaagatt caactagcct atcaaaaatc 1740 
agggaacttc aaaaagatca ctgtgaacaa 1800 
aagttttgtg tactgaaaaa gaaactgtca 1860 
aaccaaaaag ttaaatggga acaagagctc 1920 
gaagagaaga gaagaaatgc cgatatatta 1980 
atcgaagagc agcataggaa agagttagaa 2040 
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gtgaaacaac aacttgaaca ggctctcaga atacaagata tagaattgaa gagtgtagaa 2100 
agtaatttaa atcaggtttc tcacactcat gaaaatgaaa attatctctt acatgaaaat 2160 
tgcatgttga aaaaggaaat tgccatgcta aaactggaaa tagccacact gaaacaccaa 2220 
taccaggaaa aggaaaataa atactttgag gacattaaga ttttaaaaga aaagaatgct 2280 
gaacttcaga tgaccctaaa actgaaagag gaatcattaa ctaaaagggc atctcaatat 2340 
agtgggcagc ttaaagttct gatagctgag aacacaatgc tcacttctaa attgaaggaa 2400 
aaacaagaca aagaaatact agaggcagaa attgaatcac accatcctag actggcttct 2460 
gctgtacaag accatgatca aattgtgaca tcaagaaaaa gtcaagaacc tgctttccac 2520 
attgcaggag atgcttgttt gcaaagaaaa atgaatgttg atgtgagtag tacgatatat 2580 
aacaatgagg tgctccatca accactttct gaagctcaaa ggaaatccaa aagcctaaaa 2640 
attaatctca attatgccgg agatgctcta agagaaaata cattggtttc agaacatgca 2700 
caaagagacc aacgtgaaac acagtgtcaa atgaaggaag ctgaacacat gtatcaaaac 2760 
gaacaagata atgtgaacaa acacactgaa cagcaggagt ctctagatca gaaattattt 2820 
caactacaaa gcaaaaatat gtggcttcaa cagcaattag ttcatgcaca taagaaagct 2880 
gacaacaaaa gcaagataac aattgatatt cattttcttg agaggaaaat gcaacatcat 2940 
ctcctaaaag agaaaaatga ggagatattt aattacaata accatttaaa aaaccgtata 3000 
tatcaatatg aaaaagagaa agcagaaaca gaagttatat aatag 3045 

<210> 549 

<211> 1953 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 985 

<223> n = A,T,C or G 
<400> 549 

atgagtcccg caaaagaaac atctgagaaa tttacgtggg cagcaaaagg aagacctagg 60 
aagatcgcat gggagaaaaa agaaacacct gtaaagactg gatgcgtggc aagagtaaca 120 
tctaataaaa ctaaagtttt ggaaaaagga agatctaaga tgattgcatg tcctacaaaa 180 
gaatcatcta caaaagcaag tgccaatgat cagaggttcc catcagaatc caaacaagag 240 
gaagatgaag aatattcttg tgattctcgg agtctctttg agagttctgc aaagattcaa 300 
gtgtg-batac ctgagtctat atatcaaaaa gtaatggaga taaatagaga agtagaagag 360 
cctcctaaga agccatctgc cttcaagcct gccattgaaa tgcaaaactc tgttccaaat 420 
aaagcctttg aattgaagaa tgaacaaaca ttgagagcag atccgatgtt cccaccagaa 480 
tccaaacaaa aggactatga agaaaattct tgggattctg agagtctctg tgagactgtt 540 
tcacagaagg atgtgtgttt acccaaggct acacatcaaa aagaaataga taaaataaat 600 
ggaaaattag aagagtctcc taataaagat ggtcttctga aggctacctg cggaatgaaa 660 
gtttctattc caactaaagc cttagaattg aaggacatgc aaactttcaa agcagagcct 720 
ccggggaagc catctgcctt cgagcctgcc actgaaatgc aaaagtctgt cccaaataaa 780 
gccttggaat tgaaaaatga acaaacattg agagcagatg agatactccc atcagaatcc 840 
aaacaaaagg actatgaaga aaattcttgg gatactgaga gtctctgtga gactgtttca 900 
cagaaggatg tgtgtttacc caaggctgcg catcaaaaag aaatagataa aataaatgga 960 
aaattagaag ggtctcctgg taaanatggt cttctgaagg ctaactgcgg aatgaaagtt 1020 
tctattccaa ctaaagcctt agaattgatg gacatgcaaa ctttcaaagc agagcctccc 1080 
gagaagccat ctgccttcga gcctgccatt gaaatgcaaa agtctgttcc aaataaagcc 1140 
ttggaattga agaatgaaca aacattgaga gcagatgaga tactcccatc agaatccaaa 1200 
caaaaggact atgaagaaag ttcttgggat tctgagagtc tctgtgagac tgtttcacag 1260 
aaggatgtgt gtttacccaa ggctgcgcat caaaaagaaa tagataaaat aaatggaaaa 1320 
ttagaagagt ctcctgataa tgatggtttt ctgaagtctc cctgcagaat gaaagtttct 1380 
attccaacta aagccttaga attgatggac atgcaaactt tcaaagcaga gcctcccgag 1440 
aagccatctg ccttcgagcc tgccattgaa atgcaaaagt c-tgttccaaa taaagccttg 1500 
gaattgaaga atgaacaaac attgagagca gatcagatgt tcccttcaga atcaaaacaa 1560 
aagaacgttg aagaaaattc ttgggattct gagagtctcc gtgagactgt ttcacagaag 1620 
gatgtgtgtg tacccaaggc tacacatcaa aaagaaatgg ataaaataag tggaaaatta 1680 
gaagattcaa ctagcctatc aaaaatcttg gatacagttc attcttgtga aagagcaagg 1740 
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gaacttcaaa aagatcactg tgaacaacgt 
ttttgtgtac tgaaaaagaa actgtcagaa 
caaaaagtta aatgggaaca agagctctgc 
aaaattatct cttacatgaa aattgcatgt 



acaggaaaaa tggaacaaat gaaaaagaag 1800 

gcaaaagaaa taaaatcaca gttagagaac 1860 

agtgtgaggt ttctcacact catgaaaatg 1920 

tga 1953 



<210> 550 
<211> 978 
<212> DNA 
<213> Homo sapiens 



<400> 550 

atgcagcatc accaccatca ccacggcaca 
catgaaaatg aaaattatct cttacatgaa 
ctaaaactgg aaatagccac actgaaacac 
gaggacatta agattttaaa agaaaagaat 
gaggaatcat taactaaaag ggcatctcaa 
gagaacacaa tgctcacttc taaattgaag 
gaaattgaat cacaccatcc tagactggct 
acatcaagaa aaagtcaaga acctgctttc 
aaaatgaatg ttgatgtgag tagtacgata 
tctgaagctc aaaggaaatc caaaagccta 
ctaagagaaa atacattggt ttcagaacat 
caaatgaagg aagctgaaca catgtatcaa 
gaacagcagg agtctctaga tcagaaatta 
caacagcaat tagttcatgc acataagaaa 
attcattttc ttgagaggaa aatgcaacat 
tttaattaca ataaccattt aaaaaaccgt 
acagaagtta tataatag 



agagctctgc agtgtgaggt ttctcacact 60 
aattgcatgt tgaaaaagga aattgccatg 120 
caataccagg aaaaggaaaa taaatacttt 180 
gctgaacttc agatgaccct aaaactgaaa 240 
tatagtgggc agcttaaagt tctgatagct 300 
gaaaaacaag acaaagaaat actagaggca 360 
tctgctgtac aagaccatga tcaaattgtg 420 
cacattgcag gagatgcttg tttgcaaaga 480 
tataacaatg aggtgctcca tcaaccactt 540 
aaaattaatc tcaattatgc cggagatgct 600 
gcacaaagag accaacgtga aacacagtgt 660 
aacgaacaag ataatgtgaa caaacacact 720 
tttcaactac aaagcaaaaa tatgtggctt 780 
gctgacaaca aaagcaagat aacaattgat 840 
catctcctaa aagagaaaaa tgaggagata 900 
atatatcaat atgaaaaaga gaaagcagaa 960 

978 



<210> 551 
<211> 324 
<212> PRT 
<213> Homo sapiens 



<400> 551 

Met Gin His His His His His His Gly Thr Arg Ala Leu Gin Cys Glu 
5 10 15 

Val Ser His Thr His Glu Asn Glu Asn Tyr Leu Leu His Glu Asn Cys 
20 25 30 

Met Leu Lys Lys Glu lie Ala Met Leu Lys Leu Glu lie Ala Thr Leu 
35 40 45 

Lys His Gin Tyr Gin Glu Lys Glu Asn Lys Tyr Phe Glu Asp lie Lys 
50 55 60 

He Leu Lys Glu Lys Asn Ala Glu Leu Gin Met Thr Leu Lys Leu Lys 
65 70 75 80 

Glu Glu Ser Leu Thr Lys Arg Ala Ser Gin Tyr Ser Gly Gin Leu Lys 
85 90 95 

Val Leu lie Ala Glu Asn Thr Met Leu Thr Ser Lys Leu Lys Glu Lys 
100 105 110 



Gin Asp Lys Glu He Leu Glu Ala Glu He Glu Ser His His Pro Arg 
115 120 125 
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Leu Ala Ser Ala Val Gin Asp His Asp Gin lie Val Thr Ser Arg Lys 
130 135 140 

Ser Gin Glu Pro Ala Phe His lie Ala Gly Asp Ala Cys Leu Gin Arg 
145 150 155 160 

Lys Met Asn Val Asp Val Ser Ser Thr lie Tyr Asn Asn Glu Val Leu 
165 170 175 

His Gin Pro Leu Ser Glu Ala Gin Arg Lys Ser Lys Ser Leu Lys lie 
180 185 190 

Asn Leu Asn Tyr Ala Gly Asp Ala Leu Arg Glu Asn Thr Leu Val Ser 
195 200 205 

Glu His Ala Gin Arg Asp Gin Arg Glu Thr Gin Cys Gin Met Lys Glu 
210 215 220 

Ala Glu His Met Tyr Gin Asn Glu Gin Asp Asn Val Asn Lys His Thr 
225 230 235 -240 

Glu Gin Gin Glu Ser Leu Asp Gin Lys Leu Phe Gin Leu Gin Ser Lys 
245 250 255 

Asn Met Trp Leu Gin Gin Gin Leu Val His Ala His Lys Lys Ala Asp 
260 265 270 

Asn Lys Ser Lys lie Thr He Asp He His Phe Leu Glu Arg Lys Met 
275 280 285 

Gin His His Leu Leu Lys Glu Lys Asn Glu Glu He Phe Asn Tyr Asn 
290 295 300 

Asn His Leu Lys Asn Arg lie Tyr Gin Tyr Glu Lys Glu Lys Ala Glu 
305 310 315 320 

Thr Glu Val He 



<210> 552 
<211> 661 
<212> PRT 

<213> Homo sapiens 
<400> 552 

Met Gin His His His His His His Val Gly Ser Met Ser Pro Ala Lys 
5 10 15 

Glu Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys Gly Arg Pro Arg Lys 
20 25 30 

He Ala Trp Glu Lys Lys Glu Thr Pro Val Lys Thr Gly Cys Val Ala 
35 40 45 



Arg Val Thr Ser Asn Lys Thr Lys Val Leu Glu Lys Gly Arg Ser Lys 
50 55 60 
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Met lie Ala Cys Pro Thr Lys Glu Ser Ser Thr Lys Ala Ser Ala Asn 
65 70 75 80, 

Asp Gin Arg Phe Pro Ser Glu Ser Lys Gin Glu Glu Asp Glu Glu Tyr 
85 90 95 

Ser Cys Asp Ser Arg Ser Leu Phe Glu Ser Ser Ala Lys He Gin Val 
100 105 110 

Cys He Pro Glu Ser He Tyr Gin Lys Val Met Glu He Asn Arg Glu 
115 120 125 

Val Glu Glu Pro Pro Lys Lys Pro Ser Ala Phe Lys Pro Ala He Glu 
130 135 140 

Met Gin Asn Ser Val Pro Asn Lys Ala Phe Glu Leu Lys Asn Glu Gin 
145 150 155 160 

Thr Leu Arg Ala Asp Pro Met Phe Pro Pro Glu Ser Lys Gin Lys Asp 
165 170 175 

Tyr Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr Val Ser 
180 185 190 

Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His Gin Lys Glu He Asp 
195 200 205 

Lys He Asn Gly Lys Leu Glu Glu Ser Pro Asn Lys Asp Gly Leu Leu 
210 215 220 

Lys Ala Thr Cys Gly Met Lys Val Ser He Pro Thr Lys Ala Leu Glu 
225 230 235 240 

Leu Lys Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Gly Lys Pro Ser 
245 250 255 

Ala Phe Glu Pro Ala Thr Glu Met Gin Lys Ser Val Pro Asn Lys Ala 
260 265 270 

Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Glu He Leu Pro 
275 280 285 

Ser Glu Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Thr Glu 
290 295 300 

Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala 
305 310 315 320 

Ala His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Gly Ser 
325 330 335 

Pro Gly Lys Asp Gly Leu Leu Lys Ala Asn Cys Gly Met Lys Val Ser 
340 345 350 

He Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala 
355 360 365 
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Glu Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala lie Glu Met Gin 
370 375 380 

Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu 
385 390 395 400 

Arg Ala Asp Glu lie Leu Pro Ser Glu Ser Lys Gin Lys Asp Tyr Glu 
405 410 415 

Glu Ser Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr Val Ser Gin Lys 
420 425 430 

Asp Val Cys Leu Pro Lys Ala Ala His Gin Lys Glu lie Asp Lys lie 
435 440 445 

Asn Gly Lys Leu Glu Glu Ser Pro Asp Asn Asp Gly Phe Leu Lys Ser 
450 455 460 

Pro Cys Arg Met Lys Val Ser He Pro Thr Lys Ala Leu Glu Leu Met 
465 470 475 480 

Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala Phe 
485 490 495 

Glu Pro Ala He Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu Glu 
500 505 510 

Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Gin Met Phe Pro Ser Glu 
515 520 525 

Ser Lys Gin Lys Asn Val Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu 
530 535 540 

Arg Glu Thr Val Ser Gin Lys Asp Val Cys Val Pro Lys Ala Thr His 
545 550 555 560 

Gin Lys Glu Met Asp Lys He Ser Gly Lys Leu Glu Asp Ser Thr Ser 
565 570 575 

Leu Ser Lys He Leu Asp Thr Val His Ser Cys Glu Arg Ala Arg Glu 
580 585 590 

Leu Gin Lys Asp His Cys Glu Gin Arg Thr Gly Lys Met Glu Gin Met 
595 600 605 

Lys Lys Lys Phe Cys Val Leu Lys Lys Lys Leu Ser Glu Ala Lys Glu 
610 615 620 

He Lys Ser Gin Leu Glu Asn Gin Lys Val Lys Trp Glu Gin Glu Leu 
625 630 635 640 

Cys Ser Val Arg Phe Leu Thr Leu Met Lys Met Lys He He Ser Tyr 
645 650 655 



Met Lys He Ala Cys 
660 
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<210> 553 
<211> 1013 
<212> PRT 

<213> Homo sapiens 
<400> 553 

Met Gin His His His His His His Val Gly Ser Met Ser Pro Ala Lys 
5 10 15 

Glu Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys Gly Arg Pro Arg Lys 
20 25 30 

lie Ala Trp Glu Lys Lys Glu Thr Pro Val Lys Thr Gly Cys Val Ala 
35 40 45 

Arg Val Thr Ser Asn Lys Thr Lys Val Leu Glu Lys Gly Arg Ser Lys 
50 55 . 60 

Met lie Ala Cys Pro Thr Lys Glu Ser Ser Thr Lys Ala Ser Ala Asn 
65 70 75 80 

Asp Gin Arg Phe Pro Ser Glu Ser Lys Gin Glu Glu Asp Glu Glu Tyr 
85 90 95 

Ser Cys Asp Ser Arg Ser Leu Phe Glu Ser Ser Ala Lys lie Gin Val 
100 105 110 

Cys lie Pro Glu Ser lie Tyr Gin Lys Val Met Glu lie Asn Arg Glu 
115 120 125 

Val Glu Glu Pro Pro Lys Lys Pro Ser Ala Phe Lys Pro Ala lie Glu 
130 135 140 

Met Gin Asn Ser Val Pro Asn Lys Ala Phe Glu Leu Lys Asn Glu Gin 
145 150 155 160 

Thr Leu Arg Ala Asp Pro Met Phe Pro Pro Glu Ser Lys Gin Lys Asp 
165 170 175 

Tyr Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr Val Ser 
180 185 190 

Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His Gin Lys Glu lie Asp 
195 200 205 

Lys lie Asn Gly Lys Leu Glu Glu Ser Pro Asn Lys Asp Gly Leu Leu 
210 215 220 

Lys Ala Thr Cys Gly Met Lys Val Ser lie Pro Thr Lys Ala Leu Glu 
225 230 235 240 

Leu Lys Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Gly Lys Pro Ser 
245 250 255 

Ala Phe Glu Pro Ala Thr Glu Met Gin Lys Ser Val Pro Asn Lys Ala 
260 265 270 

Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Glu lie Leu Pro 
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180 



275 



280 



285 



Ser Glu Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Thr Glu 
290 295 300 

Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala 
305 310 315 320 

Ala His Gin Lys Glu lie Asp Lys He Asn Gly Lys Leu Glu Gly Ser 
325 330 335 

Pro Gly Lys Asp Gly Leu Leu Lys Ala Asn Cys Gly Met Lys Val Ser 
340 345 350 

He Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala 
355 360 365 

Glu Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin 
370 375 380 

Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu 
385 390 395 400 

Arg Ala Asp Glu He Leu Pro Ser Glu Ser Lys Gin Lys Asp Tyr Glu 
405 410 415 

Glu Ser Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr Val Ser Gin Lys 
420 425 430 

Asp Val Cys Leu Pro Lys Ala Ala His Gin Lys Glu He Asp Lys He 
435 440 445 

Asn Gly Lys Leu Glu Glu Ser Pro Asp Asn Asp Gly Phe Leu Lys Ser 
450 455 460 

Pro Cys Arg Met Lys Val Ser He Pro Thr Lys Ala Leu Glu Leu Met 
465 470 475 480 

Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala Phe 
485 490 495 

Glu Pro Ala He Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu Glu 
500 505 510 

Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Gin Met Phe Pro Ser Glu 
515 520 525 

Ser Lys Gin Lys Asn Val Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu 
530 535 540 

Arg Glu Thr Val Ser Gin Lys Asp Val Cys Val Pro Lys Ala Thr His 
545 550 555 560 

Gin Lys Glu Met Asp Lys He Ser Gly Lys Leu Glu Asp Ser Thr Ser 
565 570 575 

Leu Ser Lys He Leu Asp Thr Val His Ser Cys Glu Arg Ala Arg Glu 



580 



585 



590 
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Leu Gin Lys Asp His Cys Glu Gin Arg Thr Gly Lys Met Glu Gin Met 
595 600 605 

Lys Lys Lys Phe Cys Val Leu Lys Lys Lys Leu Ser Glu Ala Lys Glu 
610 615 620 

lie Lys Ser Gin Leu Glu Asn Gin Lys Val Lys Trp Glu Gin Glu Leu 
625 630 635 640 

Cys Ser Val Arg Leu Thr Leu Asn Gin Glu Glu Glu Lys Arg Arg Asn 
645 650 655 

Ala ASP He Leu Asn Glu Lys He Arg Glu Glu Leu Gly Arg He Glu 
660 665 670 

Glu Gin His Arg Lys Glu Leu Glu Val Lys Gin Gin Leu Glu Gin Ala 
675 680 685 

Leu Arg He Gin Asp He Glu Leu Lys Ser Val Glu Ser Asn Leu Asn 
690 695 700 

Gin Val Ser His Thr His Glu Asn Glu Asn Tyr Leu Leu His Glu Asn 
705 710 715 720 

CVS Met Leu Lys Lys Glu He Ala Met Leu Lys Leu Glu He Ala Thr 
725 730 735 

Leu Lys His Gin Tyr Gin Glu Lys Glu Asn Lys Tyr Phe Glu Asp He 
740 745 750 

Lys He Leu Lys Glu Lys Asn Ala Glu Leu Gin Met Thr Leu Lys Leu 
755 760 765 

Lys Glu Glu Ser Leu Thr Lys Arg Ala Ser Gin Tyr Ser Gly Gin Leu 
770 775 780 

Lvs Val Leu He Ala Glu Asn Thr Met Leu Thr Ser Lys Leu Lys Glu 
785 790 795 800 

Lys Gin Asp Lys Glu He Leu Glu Ala Glu He Glu Ser His His Pro 
805 810 815 

Arg Leu Ala Ser Ala Val Gin Asp His Asp Gin He Val Thr Ser Arg 
820 825 830 

Lys Ser Gin Glu Pro Ala Phe His He Ala Gly Asp Ala Cys Leu Gin 
835 840 845 

Arg Lys Met Asn Val Asp Val Ser Ser Thr He Tyr Asn Asn Glu Val 
850 855 860 

Leu His Gin Pro Leu Ser Glu Ala Gin Arg Lys Ser Lys Ser Leu Lys 
865 870 875 880 

He Asn Leu Asn Tyr Ala Gly Asp Ala Leu Arg Glu Asn Thr Leu Val 
885 890 895 
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Ser Glu His Ala Gin Arg Asp Gin 
900 

Glu Ala Glu His Met Tyr Gin Asn 
915 920 

Thr Glu Gin Gin Glu Ser Leu Asp 
930 935 

Lys Asn Met Trp Leu Gin Gin Gin 
945 950 



Arg Glu Thr Gin Cys Gin Met Lys 
905 910 

Glu Gin Asp Asn Val Asn Lys His 
925 

Gin Lys Leu Phe Gin Leu Gin Ser 
940 

Leu Val His Ala His Lys Lys Ala 
955 960 



Asp Asn Lys Ser Lys lie Thr lie Asp He His Phe Leu Glu Arg Lys 
965 970 975 

Met Gin His His Leu Leu Lys Glu Lys Asn Glu Glu He Phe Asn Tyr 
980 985 990 

Asn Asn His Leu Lys Asn Arg lie Tyr Gin Tyr Glu Lys Glu Lys Ala 
995 1000 1005 



Glu Thr Glu Val He 
1010 



<210> 554 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 554 

gtcggctcca tgagtcccgc aaaag 25 

<210> 555 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 555 

cgagaattca atacttaaga agaccatctt taccag 36 

<210> 556 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PGR primer 
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<400> 556 

cataagctta aggctaactg cggaatgaaa g 

<210> 557 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 557 

cccgcagaat tcaacatgca attttcatgt aagag 

<210> 558 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 558 

ctaaatgccg gcacaagagc tctgc 

<210> 559 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 559 

cgcgcagaat tctattatat aacttctgtt tctgc 

<210> 560 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 560 

ggggaattgt gagcggataa caattc 

<210> 561 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
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<400> 561 

cgtagaattc aacctgattt aaattacttt ctacac 36 

<210> 562 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 562 

gaaagtaatt taaatcaggt ttctcacact c 31 

<210> 563 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 563 

gaggccccaa ggggttatgc tag 23 



<210> 564 

<211> 4458 

<212> DNA 

<213> Homo sapiens 

<400> 564 

ctagtctata ccagcaacga ctcctacatc gtccactctg gggatcttag aaagatccat 60 

aaagctgcct cccggggaca agtccggaag ctggagaaga tgacaaagag gaagaagacc 120 

atcaacctta atatacaaga cgcccagaag aggactgctc tacactgggc ctgtgtcaat 180 

ggccatgagg aagtagtaac atttctggta gacagaaagt gccagcttga cgtccttgat 240 

ggcgaacaca ggacacctct gatgaaggct ctacaatgcc atcaggaggc ttgtgcaaat 300 

attctgatag attctggtgc cgatataaat ctcgtagatg tgtatggcaa catggctctc 360 

cattatgctg tttatagtga gattttgtca gtggtggcaa aactgctgtc ccatggtgca 420 

gtcatcgaag tgcacaacaa ggctagcctc acaccacttt tactatccat aacgaaaaga 480 

agtgagcaaa ttgtggaatt tttgctgata aaaaatgcaa atgcgaatgc agttaataag 540 

tataaatgca cagccctcat gcttgctgta tgtcatggat catcagagat agttggcatg 600 

cttcttcagc aaaatgttga cgtctttgct gcagatatat gtggagtaac tgcagaacat 660 

tatgctgtta cttgtggatt tcatcacatt catgaacaaa ttatggaata tatacgaaaa 720 

ttatctaaaa atcatcaaaa taccaatcca gaaggaacat ctgcaggaac acctgatgag 780 

gctgcaccct tggcggaaag aacacctgac acagctgaaa gcttggtgga aaaaacacct 840 

gatgaggctg cacccttggt ggaaagaaca cctgacacgg ctgaaagctt ggtggaaaaa 900 

acacctgatg aggctgcatc cttggtggag ggaacatctg acaaaattca atgtttggag 960 

aaagcgacat ctggaaagtt cgaacagtca gcagaagaaa cacctaggga aattacgagt 1020 

cctgcaaaag aaacatctga gaaatttacg tggccagcaa aaggaagacc taggaagatc 1080 

gcatgggaga aaaaagaaga cacacctagg gaaattatga gtcccgcaaa agaaacatct 1140 

gagaaattta cgtgggcagc aaaaggaaga cctaggaaga tcgcatggga gaaaaaagaa 1200 



wo 02/083956 



PCT/US02/12378 



185 



acacctgtaa agactggatg cgtggcaaga 
aaaggaagat ctaagatgat tgcatgtcct 
aatgatcaga ggttcccatc agaatccaaa 
tctcggagtc tctttgagag ttctgcaaag 
caaaaagtaa tggagataaa tagagaagta 
aagcctgcca ttgaaatgca aaactctgtt 
caaacattga gagcagatcc gatgttccca 
aattcttggg attctgagag tctctgtgag 
aaggctacac atcaaaaaga aatagataaa 
aaagatggtc ttctgaaggc tacctgcgga 
gaattgaagg acatgcaaac tttcaaagcg 
cctgccactg aaatgcaaaa gtctgtccca 
acatggagag cagatgagat actcccatca 
tcttgggata ctgagagtct ctgtgagact 
gctgcgcatc aaaaagaaat agataaaata 
gatggtcttc tgaaggctaa ctgcggaatg 
ttgatggaca tgcaaacttt caaagcagag 
gccattgaaa tgcaaaagtc tgttccaaat 
ttgagagcag atgagatact cccatcagaa 
tgggattctg agagtctctg tgagactgtt 
acacatcaaa aagaaataga taaaataaat 
ggttttctga aggctccctg cagaatgaaa 
atggacatgc aaactttcaa agcagagcct 
attgaaatgc aaaag-tctgt tccaaataaa 
agagcagatc agatgttccc ttcagaatca 
gattctgaga gtctccgtga gactgtttca 
catcaaaaag aaatggataa aataagtgga 
atcttggata cagttcattc ttgtgaaaga 
caacgtacag gaaaaatgga acaaatgaaa 
tcagaagcaa aagaaataaa atcacagtta 
ctctgcagtg tgagattgac tttaaaccaa 
ttaaatgaaa aaattaggga agaattagga 
gaagtgaaac aacaacttga acaggctctc 
gaaagtaatt tgaatcaggt ttctcacact 
aattgcatgt tgaaaaagga aattgccatg 
caataccagg aaaaggaaaa taaatacttt 
gctgaacttc agatgaccct aaaactgaaa 
tatagtgggc agcttaaagt tctgatagct 
gaaaaacaag acaaagaaat actagaggca 
tctgctgtac aagaccatga tcaaattgtg 
cacattgcag gagatgcttg tttgcaaaga 
tataacaatg aggtgctcca tcaaccactt 
aaaattaatc tcaattatgc aggagatgct 
gcacaaagag accaacgtga aacacagtgt 
aacgaacaag ataatgtgaa caaacacact 
tttcaactac aaagcaaaaa tatgtggctt 
gctgacaaca aaagcaagat aacaattgat 
catctcctaa aagagaaaaa tgaggagata 
atatatcaat atgaaaaaga gaaagcagaa 
aaacttcttt tggagaaaca acagaccaga 
gtcctagcat caccttatgt tgaaaatctt 
tttagaagaa aaattcatga tttcttcctg 
gaattacttg ttcacgaatt gcataaagct 
agcagacatc attcaatcca accagaatct 
gagactccac ctcggaaa 



gtaacatcta ataaaactaa agttttggaa 1260 
acaaaagaat catctacaaa agcaagtgcc 1320 
caagaggaag atgaagaata ttcttgtgat 1380 
attcaagtgt gtatacctga gtctatatat 1440 
gaagagcctc ctaagaagcc atctgccttc 1500 
ccaaataaag cctttgaatt gaagaatgaa 1560 
ccagaatcca aacaaaagga ctatgaagaa 1620 
actgtttcac agaaggatgt gtgtttaccc 1680 
ataaatggaa aattagaaga gtctcctaat 1740 
atgaaagttt ctattccaac taaagcctta 1800 
gagcctccgg ggaagccatc tgccttcgag 1860 
aataaagcct tggaattgaa aaatgaacaa 1920 
gaatccaaac aaaaggacta tgaagaaaat 1980 
gtttcacaga aggatgtgtg tttacccaag 2040 
aatggaaaat tagaagggtc tcctgttaaa 2100 
aaagtttcta ttccaactaa agccttagaa 2160 
cctcccgaga agccatctgc cttcgagcct 2220 
aaagccttgg aattgaagaa tgaacaaaca 2280 
tccaaacaaa aggactatga agaaagttct 2340 
tcacagaagg atgtgtgttt acccaaggct 2400 
ggaaaattag aagagtctcc tgataatgat 2460 
gtttctattc caactaaagc cttagaattg 2520 
cccgagaagc catctgcctt cgagcctgcc 2580 
gccttggaat tgaagaatga acaaacattg 2640 
aaacaaaaga aggttgaaga aaattcttgg 2700 
cagaaggatg tgtgtgtacc caaggctaca 2760 
aaattagaag attcaactag cctatcaaaa 2820 
gcaagggaac ttcaaaaaga tcactgtgaa 2880 
aagaagtttt gtgtactgaa aaagaaactg 2940 
gagaaccaaa aagttaaatg ggaacaagag 3000 
gaagaagaga agagaagaaa tgccgatata 3060 
agaatcgaag agcagcatag gaaagagtta 3120 
agaatacaag atatagaatt gaagagtgta 3180 
catgaaaatg aaaattatct cttacatgaa 3240 
ctaaaactgg aaatagccac actgaaacac 3300 
gaggacatta agattttaaa agaaaagaat 3360 
gaggaatcat taactaaaag ggcatctcaa 3420 
gagaacacaa tgctcacttc taaattgaag 3480 
gaaattgaat cacaccatcc tagactggct 3540 
acatcaagaa aaagtcaaga acctgctttc 3600 
aaaatgaatg ttgatgtgag tagtacgata 3660 
tctgaagctc aaaggaaatc caaaagccta 3720 
ctaagagaaa atacattggt ttcagaacat 3780 
caaatgaagg aagctgaaca catgtatcaa 3840 
gaacagcagg agtctctaga tcagaaatta 3900 
caacagcaat tagttcatgc acataagaaa 3960 
attcattttc ttgagaggaa aatgcaacat 4020 
tttaattaca ataaccattt aaaaaaccgt 4080 
acagaaaact catgagagac aagcagtaag 4140 
tctttactca caactcatgc taggaggcca 4200 
accaatagtc tgtgtcaaca gaatacttat 4260 
aagcctacag acataaaata acagtgtgaa 4320 
gcacaggatt cccatctacc ctgatgatgc 4380 
cgctctgcac tccagcctag gtgacagagt 4440 

4458 



<210> 565 
<211> 1341 
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<212> PRT 

<213> Homo sapiens 
<400> 565 

Met Thr Lys Arg Lys Lys Thr He Asn Leu Asn He Gin Asp Ala Gin 
5 10 15 

Lys Arg Thr Ala Leu His Trp Ala Cys Val Asn Gly His Glu Glu Val 
20 25 30 

Val Thr Phe Leu Val Asp Arg Lys Cys Gin Leu Asp Val Leu Asp Gly 
35 40 45 

Glu His Arg Thr Pro Leu Met Lys Ala Leu Gin Cys His Gin Glu Ala 
50 55 60 

Cys Ala Asn He Leu He Asp Ser Gly Ala Asp He Asn Leu Val Asp 
65 70 75 80 

Val Tyr Gly Asn Met Ala Leu His Tyr Ala Val Tyr Ser Glu He Leu 
85 90 95 

Ser Val Val Ala Lys Leu Leu Ser His Gly Ala Val He Glu Val His 
100 105 110 

Asn Lys Ala Ser Leu Thr Pro Leu Leu Leu Ser He Thr Lys Arg Ser 
115 120 125 

Glu Gin He Val Glu Phe Leu Leu He Lys Asn Ala Asn Ala Asn Ala 
130 135 140 

Val Asn Lys Tyr Lys Cys Thr Ala Leu Met Leu Ala Val Cys His Gly 
145 150 155 160 

Ser Ser Glu He Val Gly Met Leu Leu Gin Gin Asn Val Asp Val Phe 
165 170 175 

Ala Ala Asp He Cys Gly Val Thr Ala Glu His Tyr Ala Val Thr Cys 
180 185 190 

Gly Phe His His He His Glu Gin He Met Glu Tyr He Arg Lys Leu 
195 200 205 

Ser Lys Asn His Gin Asn Thr Asn Pro Glu Gly Thr Ser Ala Gly Thr 
210 215 220 

Pro Asp Glu Ala Ala Pro Leu Ala Glu Arg Thr Pro Asp Thr Ala Glu 
225 230 235 240 

Ser Leu Val Glu Lys Thr Pro Asp Glu Ala Ala Pro Leu Val Glu Arg 
245 250 255 

Thr Pro Asp Thr Ala Glu Ser Leu Val Glu Lys Thr Pro Asp Glu Ala 
260 265 270 

Ala Ser Leu Val Glu Gly Thr Ser Asp Lys He Gin Cys Leu Glu Lys 
275 280 285 
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Ala Thr Ser Gly Lys Phe Glu Gin Ser Ala Glu Glu Thr Pro Arg Glu 
290 295 300 

He Thr ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Pro Ala 
305 3" ^ 

Lys Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Asp Thr Pro 
325 330 

Glu He Met ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp 



Arg 



340 345 



350 



Ala Ala Lys Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Thr 
355 350 365 

Pro Val Lys Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys 
370 375 380 

val Leu Glu Lys Gly Arg Ser Lys Met He Ala Cys Pro Thr Lys Glu 
385 390 395 

Ser ser Thr Lys Ala Ser Ala Asn Asp Gin Arg Phe Pro Ser Glu Ser 
405 410 415 

Lys Gin Glu Glu Asp Glu Glu Tyr Ser Cys Asp Ser Arg Ser Leu Phe 
420 425 430 

Glu ser Ser Ala Lys He Gin Val Cys He Pro Glu Ser He Tyr Gin 
435 440 445 

Lys val Met Glu He Asn Arg Glu Val Glu Glu Pro Pro Lys Lys Pro 
450 455 460 



Ser Ala Phe Lys Pro Ala He Glu Met Gin Asn Ser Val Pro Asn Lys 
465 



470 475 480 



Ala Phe Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Pro Met Phe 
485 490 495 

Pro Pro Glu Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Ser 
500 505 510 

Glu Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys 
515 520 525 

Ala Thr His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Glu 
530 535 540 

pro Asn Lys Asp Gly Leu Leu Lys Ala Thr Cys Gly Met Lys Val 



Ser 

545 550 



555 560 



Ser He Pro Thr Lys Ala Leu Glu Leu Lys Asp Met Gin Thr Phe Lys 
565 570 575 



Ala Glu Pro Pro Gly Lys Pro Ser Ala Phe Glu Pro Ala Thr Glu Met 
580 585 590 

Gin Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr 
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595 



600 



605 



Trp Arg Ala Asp Glu lie Leu Pro Ser Glu Ser Lys Gin Lys Asp Tyr 
610 615 620 

Glu Glu Asn Ser Trp Asp Thr Glu Ser Leu Cys Glu Thr Val Ser Gin 
625 630 635 640 

Lys Asp Val Cys Leu Pro Lys Ala Ala His Gin Lys Glu lie Asp Lys 
645 650 655 

lie Asn Gly Lys Leu Glu Gly Ser Pro Val Lys Asp Gly Leu Leu Lys 
660 665 670 

Ala Asn Cys Gly Met Lys Val Ser lie Pro Thr Lys Ala Leu Glu Leu 
675 680 685 

Met Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala 
690 695 700 

Phe Glu Pro Ala He Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu 
705 710 715 720 

Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Glu He Leu Pro Ser 
725 730 735 

Glu Ser Lys Gin Lys Asp Tyr Glu Glu Ser Ser Trp Asp Ser Glu Ser 
740 745 750 

Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr 
755 760 765 

His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Glu Ser Pro 
770 775 780 

Asp Asn Asp Gly Phe Leu Lys Ala Pro Cys Arg Met Lys Val Ser He 
785 790 795 800 

Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala Glu 
805 810 815 

Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin Lys 
820 825 830 

Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg 
835 840 845 

Ala Asp Gin Met Phe Pro Ser Glu Ser Lys Gin Lys Lys Val Glu Glu 
850 855 860 

Asn Ser Trp Asp Ser Glu Ser Leu Arg Glu Thr Val Ser Gin Lys Asp 
870 875 880 

Val Cys Val Pro Lys Ala Thr His Gin Lys Glu Met Asp Lys He Ser 
885 890 895 



Gly Lys Leu Glu Asp Ser Thr Ser Leu Ser Lys He Leu Asp Thr Val 
900 ^ 905 910 
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His Ser Cys Glu Arg Ala Arg Glu Leu Gin Lys Asp His Cys Glu Gin 
915 920 925 

Arg Thr Gly Lys Met Glu Gin Met Lys Lys Lys Phe Cys Val Leu Lys 
930 935 940 

Lys Lys Leu Ser Glu Ala Lys Glu lie Lys Ser Gin Leu Glu Asn Gin 
945 950 955 960 

Lys Val Lys Trp Glu Gin Glu Leu Cys Ser Val Arg Leu Thr Leu Asn 
965 970 975 

Gin Glu Glu Glu Lys Arg Arg Asn Ala Asp lie Leu Asn Glu Lys He 
980 985 990 

Arg Glu Glu Leu Gly Arg He Glu Glu Gin His Arg Lys Glu Leu Glu 
995 1000 1005 

Val Lys Gin Gin Leu Glu Gin Ala Leu Arg He Gin Asp He Glu Leu 
1010 1015 1020 

Lys Ser Val Glu Ser Asn Leu Asn Gin Val Ser His Thr His Glu Asn 
1025 1030 1035 1040 

Glu Asn Tyr Leu Leu His Glu Asn Cys Met Leu Lys Lys Glu He Ala 
1045 1050 1055 

Met Leu Lys Leu Glu He Ala Thr Leu Lys His Gin Tyr Gin Glu Lys 
1060 1065 1070 

Glu Asn Lys Tyr Phe Glu Asp He Lys He Leu Lys Glu Lys Asn Ala 
1075 1080 1085 

Glu Leu Gin Met Thr Leu Lys Leu Lys Glu Glu Ser Leu Thr Lys Arg 
1090 1095 1100 

Ala Ser Gin Tyr Ser Gly Gin Leu Lys Val Leu He Ala Glu Asn Thr 
1105 1110 1115 1120 

Met Leu Thr Ser Lys Leu Lys Glu Lys Gin Asp Lys Glu He Leu Glu 
1125 1130 1135 

Ala Glu He Glu Ser His His Pro Arg Leu Ala Ser Ala Val Gin Asp 
1140 1145 1150 

His Asp Gin He Val Thr Ser Arg Lys Ser Gin Glu Pro Ala Phe His 
1155 1160 1165 

He Ala Gly Asp Ala Cys Leu Gin Arg Lys Met Asn Val Asp Val Ser 
1170 1175 1180 

Ser Thr He Tyr Asn Asn Glu Val Leu His Gin Pro Leu Ser Glu Ala 
1185 1190 1195 1200 

Gin Arg Lys Ser Lys Ser Leu Lys He Asn Leu Asn Tyr Ala Gly Asp 
1205 1210 1215 
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Ala Leu Arg Glu Asn Thr Leu Val Ser Glu His Ala Gin Arg Asp Gin 
1220 1225 1230 

Arg Glu Thr Gin Cys Gin Met Lys Glu Ala Glu His Met Tyr Gin Asn 
1235 1240 1245 

Glu Gin Asp Asn Val Asn Lys His Thr Glu Gin Gin Glu Ser Leu Asp 
1250 1255 1260 

Gin Lys Leu Phe Gin Leu Gin Ser Lys Asn Met Trp Leu Gin Gin Gin 
1265 1270 1275 1280 

Leu Val His Ala His Lys Lys Ala Asp Asn Lys Ser Lys He Thr He 
1285 1290 1295 

Asp He His Phe Leu Glu Arg Lys Met Gin His His Leu Leu Lys Glu 
1300 1305 1310 

Lys Asn Glu Glu He Phe Asn Tyr Asn Asn His Leu Lys Asn Arg He 
1315 1320 1325 

Tyr Gin Tyr Glu Lys Glu Lys Ala Glu Thr Glu Asn Ser 
1330 1335 1340 



<210> 566 

<211> 4047 

<212> DNA 

<213> Homo sapiens 

<400> 566 

atgcagcatc accaccatca ccaccacaca 
caagacgccc agaagaggac tgctctacac 
gtaacatttc tggtagacag aaagtgccag 
cctctgatga aggctctaca atgccatcag 
ggtgccgata taaatctcgt agatgtgtat 
agtgagattt tgtcagtggt ggcaaaactg 
aacaaggcta gcctcacacc acttttacta 
gaatttttgc tgataaaaaa tgcaaatgcg 
ctcatgcttg ctgtatgtca tggattatca 
gttgacgtct ttgctgcaga tatatgtgga 
ggatttcatc acattcatga acaaattatg 
caaaatacca atccagaagg aacatctgca 
gaaagaacac ctgacacagc tgaaagcttg 
ttggtggaaa gaacacctga cacggctgaa 
gcatccttgg tggagggaac atctgacaaa 
aagttcgaac agtcagcaga agaaacacct 
tctgagaaat ttacgtggcc agcaaaagga 
gaagacacac ctagggaaat tatgagtccc 
gcagcaaaag gaagacctag gaagatcgca 
ggatgcgtgg caagagtaac atctaataaa 
atgattgcat gtcctacaaa agaatcatct 
ccatcagaat ccaaacaaga ggaagatgaa 
gagagttctg caaagattca agtgtgtata 
ataaatagag aagtagaaga gcctcctaag 
atgcaaaact ctgttccaaa taaagccttt 
gatccgatgt tcccaccaga atccaaacaa 



aagaggaaga agaccatcaa ccttaatata 60 
tgggcctgtg tcaatggcca tgaggaagta 120 
cctgacgtcc ttgatggcga acacaggaca 180 
gaggcttgtg caaatattct gatagattct 240 
ggcaacatgg ctctccatta tgctgtttat 300 
ctgtcccatg gtgcagtcat cgaagtgcac 360 
tccataacga aaagaagtga gcaaattgtg 420 
aatgcagtta ataagtataa atgcacagcc 480 
gagatagttg gcatgcttct tcagcaaaat 540 
gtaactgcag aacattatgc tgttacttgt 600 
gaatatatac gaaaattatc taaaaatcat 660 
ggaacacctg atgaggctgc acccttggcg 720 
gtggaaaaaa cacctgatga ggctgcaccc 780 
agcttggtgg aaaaaacacc tgatgaggct 840 
attcaatgtt tggagaaagc gacatctgga 900 
agggaaatta cgagtcctgc aaaagaaaca 960 
agacctagga agatcgcatg ggagaaaaaa 1020 
gcaaaagaaa catctgagaa atttacgtgg 1080 
tgggagaaaa aagaaacacc tgtaaagact 1140 
actaaagttt tggaaaaagg aagatctaag 1200 
acaaaagcaa gtgccaatga tcagaggttc 1260 
gaatattctt gtgattctcg gagtctcttt 1320 
cctgagtcta tatatcaaaa agtaatggag 1380 
aagccatctg ccttcaagcc tgccattgaa 1440 
gaattgaaga atgaacaaac attgagagca 1500 
aaggactatg aagaaaattc ttgggattct 1560 
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gagagtctct gtgagactgt ttcacagaag gatgtgtgtt tacccaaggc tacacatcaa 1620 
aaagaaatag ataaaataaa tggaaaatta gaagagtctc ctaataaaga tggtcttctg 1680 
aaggctacct gcggaatgaa agtttctatt ccaactaaag ccttagaatt gaaggacatg 1740 
caaactttca aagcggagcc tccggggaag ccatctgcct tcgagcctgc cactgaaatg 1800 
caaaagtctg tcccaaataa agccttggaa ttgaaaaatg aacaaacatg gagagcagat 1860 
gagatactcc catcagaatc caaacaaaag gactatgaag aaaattcttg ggatactgag 1920 
agtctctgtg agactgtttc acagaaggat gtgtgtttac ccaaggctgc gcatcaaaaa 1980 
gaaatagata aaataaatgg aaaattagaa gggtctcctg ttaaagatgg tcttctgaag 2040 
gctaactgcg gaatgaaagt ttctattcca actaaagcct tagaattgat ggacatgcaa 2100 
actttcaaag cagagcctcc cgagaagcca tctgccttcg agcctgccat tgaaatgcaa 2160 
aagtctgttc caaataaagc cttggaattg aagaatgaac aaacattgag agcagatgag 2220 
atactcccat cagaatccaa acaaaaggac tatgaagaaa gttcttggga ttctgagagt 2280 
ctctgtgaga ctgtttcaca gaaggatgtg tgtttaccca aggctacaca tcaaaaagaa 2340 
atagataaaa taaatggaaa attagaagag tctcctgata atgatggttt tctgaaggct 2400 
ccctgcagaa tgaaagtttc tattccaact aaagccttag aattgatgga catgcaaact 2460 
ttcaaagcag agcctcccga gaagccatct gccttcgagc ctgccattga aatgcaaaag 2520 
tctgttccaa ataaagcctt ggaattgaag aatgaacaaa cattgagagc agatcagatg 2580 
ttcccttcag aatcaaaaca aaagaaggtt gaagaaaatt cttgggattc tgagagtctc 2640 
cgtgagactg tttcacagaa ggatgtgtgt gtacccaagg ctacacatca aaaagaaatg 2700 
gataaaataa gtggaaaatt agaagattca actagcctat caaaaatctt ggatacagtt 2760 
cattcttgtg aaagagcaag ggaacttcaa aaagatcact gtgaacaacg tacaggaaaa 2820 
atggaacaaa tgaaaaagaa gttttgtgta ctgaaaaaga aactgtcaga agcaaaagaa 2880 
ataaaatcac agttagagaa ccaaaaagtt aaatgggaac aagagctctg cagtgtgaga 2940 
ttgactttaa accaagaaga agagaagaga agaaatgccg atatattaaa tgaaaaaatt 3000 
agggaagaat taggaagaat cgaagagcag cataggaaag agttagaagt gaaacaacaa 3060 
cttgaacagg ctctcagaat acaagatata gaattgaaga gtgtagaaag taatttgaat 3120 
caggtttctc acactcatga aaatgaaaat tatctcttac atgaaaattg catgttgaaa 3180 
aaggaaattg ccatgctaaa actggaaata gccacactga aacaccaata ccaggaaaag 3240 
gaaaataaat actttgagga cattaagatt ttaaaagaaa agaatgctga acttcagatg 3300 
accctaaaac tgaaagagga atcattaact aaaagggcat ctcaatatag tgggcagctt 3360 
aaagttctga tagctgagaa cacaatgctc acttctaaat tgaaggaaaa acaagacaaa 3420 
gaaatactag aggcagaaat tgaatcacac catcctagac tggcttctgc tgtacaagac 3480 
catgatcaaa ttgtgacatc aagaaaaagt caagaacctg ctttccacat tgcaggagat 3540 
gcttgtttgc aaagaaaaat gaatgttgat gtgagtagta cgatatataa caatgaggtg 3600 
ctccatcaac cactttctga agctcaaagg aaatccaaaa gcctaaaaat taatctcaat 3660 
tatgcaggag atgctctaag agaaaataca ttggtttcag aacatgcaca aagagaccaa 3720 
cgtgaaacac agtgtcaaat gaaggaagct gaacacatgt atcaaaacga acaagataat 3780 
gtgaacaaac acactgaaca gcaggagtct ctagatcaga aattatttca actacaaagc 3840 
aaaaatatgt ggcttcaaca gcaattagtt catgcacata agaaagctga caacaaaagc 3900 
aagataacaa ttgatattca ttttcttgag aggaaaatgc aacatcatct cctaaaagag 3960 
aaaaatgagg agatatttaa ttacaataac catttaaaaa accgtatata tcaatatgaa 4020 
aaagagaaag cagaaacaga agttata 4047 

<210> 567 
<211> 1199 
<212> DNA 

<213> Homo sapiens 



<400> 567 

acaaagagga agaagaccat caaccttaat 
cactgggcct gtgtcaatgg ccatgaggaa 
cagcctgacg tccttgatgg cgaacacagg 
caggaggctt gtgcaaatat tctgatagat 
tatggcaaca tggctctcca ttatgctgtt 
ctgctgtccc atggtgcagt catcgaagtg 
ctatccataa cgaaaagaag tgagcaaatt 
gcgaatgcag ttaataagta taaatgcaca 
tcagagatag ttggcatgct tcttcagcaa 



atacaagacg cccagaagag gactgctcta 60 
gtagtaacat ttctggtaga cagaaagtgc 120 
acacctctga tgaaggctct acaatgccat 180 
tctggtgccg atataaatct cgtagatgtg 240 
tatagtgaga ttttgtcagt ggtggcaaaa 300 
cacaacaagg ctagcctcac accactttta 360 
gtggaatttt tgctgataaa aaatgcaaat 420 
gccctcatgc ttgctgtatg tcatggatta 480 
aatgttgacg tctttgctgc agatatatgt 540 
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ggagtaactg cagaacatta tgctgttact tgtggatttc atcacattca tgaacaaatt 600 

atggaatata tacgaaaatt atctaaaaat catcaaaata ccaatccaga aggaacatct 660 

gcaggaacac ctgatgaggc tgcacccttg gcggaaagaa cacctgacac agctgaaagc 720 

ttggtggaaa aaacacctga tgaggctgca cccttggtgg aaagaacacc tgacacggct 780 

gaaagcttgg tggaaaaaac acctgatgag gctgcatcct tggtggaggg aacatctgac 840 

aaaattcaat gtttggagaa agcgacatct ggaaagttcg aacagtcagc agaagaaaca 900 

cctagggaaa ttacgagtcc tgcaaaagaa acatctgaga aatttacgtg gccagcaaaa 960 

ggaagaccta ggaagatcgc atgggagaaa aaagaagaca cacctaggga aattatgagt 1020 

cccgcaaaag aaacatctga gaaatttacg tgggcagcaa aaggaagacc taggaagatc 1080 

gcatgggaga aaaaagaaac acctgtaaag actggatgcg tggcaagagt aacatctaat 1140 

aaaactaaag ttttggaaaa aggaagatct aagatgattg catgtcctac aaaagaatc 1199 

<210> 568 

<211> 1199 

<212> DNA 

<213> Homo sapiens 

<400> 568 

acaaagagga agaagaccat caaccttaat atacaagacg cccagaagag gactgctcta 60 
cactgggcct gtgtcaatgg ccatgaggaa gtagtaacat ttctggtaga cagaaagtgc 120 
cagcttgacg tccttgatgg cgaacacagg acacctctga tgaaggctct acaatgccat 180 
caggaggctt gtgcaaatat tctgatagat tctggtgccg atataaatct cgtagatgtg 240 
tatgacaaca cggctctcca ttatgctgtt tatagtgaga ttttgtcagt ggtggcaaaa 300 
ctgctgtccc atggtgcagt catctaagcg cacaacaagg ctagcctcac accactttta 360 
ctatccataa cgaaaagaag tgagcaaatt gtggaatttt tgctgataaa aaatgcaaat 420 
gcgaatgcag ttaataagta taaatgcaca gccctcatgc ttgctgtatg tcatggatca 480 
tcagagatag ttggcatgct tcttcagcaa aatgttgacg tctttgctgc agatatatgt 540 
ggagtaactg cagaacatta tgctgttact tgtggatttc atcacattca tgaacaaatt 600 
atggaatata tacgaaaatt atctaaaaat catcaaaata ccaatccaga aggaacatct 660 
gcaggaacac ctgatgaggc tgcacccttg gcggaaagaa cacctgacac agctgaaagc 720 
ttggtggaaa aaacacctga tgaggctgca cccttggtgg aaagaacacc tgacacggct 780 
gaaagcttgg tggaaaaaac acctgatgag gctgcatcct tggtggaggg aacatctgac 840 
aaaattcaat gtttggagaa agcgacatct ggaaagttcg aacagtcagc agaagaaaca 900 
cctagggaaa ttacgagtcc tgcaaaagaa acatctgaga aatttacgtg gccagcaaaa 960 
ggaagaccta ggaagatcgc atgggagaaa aaagaagaca cacctaggga aattatgagt 1020 
cccgcaaaag aaacatctga gaaatttacg tgggcagcaa aaggaagacc taggaagatc 1080 
gcatgggaga aaaaagaaac acctgtaaag actggatgcg tggcaagagt aacatctaat 1140 
aaaactaaag ttttggaaaa aggaagatct aagatgattg catgtcctac aaaagaatc 1199 

<210> 569 
<211> 1199 
<212> DNA 
<213> Homo sapiens 

<400> 569 

acaaagagga agaagaccat caaccttaat 
cactgggcct gtgtcaatgg ccatgaggaa 
cagcttgacg tccttgatgg cgaacacagg 
caggaggctt gtgcaaatat tctgatagat 
tatggcaaca tggctctcca ttatgctgtt 
ctgctgtccc atggtgcagt catcgaagtg 
ctatccataa cgaaaagaag tgagcaaatt 
gcgaatgcag ttaataagta taaatgcaca 
tcagagatag ttggcatgct tcttcagcaa 
ggagtaactg cagaacatta tgctgttact 
atggaatata tacgaaaatt atctaaaaat 
gcaggaacac ctgatgaggc tgcacccttg 
ttggtggaaa aaacacctga tgaggctgca 



atacaagacg cccagaagag gactgctcta 60 

gtagtaacat ttctggtaga cagaaagtgc 120 

acacctctga tgaaggctct acaatgccat 180 

tctggtgccg atataaatct cgtagatgtg 240 

tatagtgaga ttttgtcagt ggtggcaaaa 300 

cacaacaagg ctagcctcac accactttta 360 

gtggaatttt tgctgataaa aaatgcaaat 420 

gccctcatgc ttgctgtatg tcatggatca 480 

aatgttgacg tctttgctgc agatatatgt 540 

tgtggatttc atcacattca tgaacaaatt 600 

catcaaaata ccaatccaga aggaacatct 660 

gcggaaagaa cacctgacac agctgaaagc 720 

cccttggtgg aaagaacacc tgacacggct 780 
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gaaagcttgg tggaaaaaac acctgatgag gctgcatcct tggtggaggg aacatctgac 840 
aaaattcaat gtttggagaa agcgacatct ggaaagttcg aacagtcagc agaagaaaca 900 
cctagggaaa ttacgagtcc tgcaaaagaa acatctgaga aatttacgtg g<=^^g^^^^^ HI 
ggaagaccta ggaagatcgc atgggagaaa aaagaagaca cacctaggga aattatgagt 1020 
cccgcaaaag aaacatctga gaaatttacg tgggcagcaa aaggaagacc taggaagatc 1080 
gcatgggaga aaaaagaaac acctgtaaag actggatgcg tggcaagagt aacatctaat 1140 
aaaactaaag ttttggaaaa aggaagatct aagatgattg catgtcctac aaaagaatc 1199 

<210> 570 
<211> 399 
<212> PRT 
<213> Homo sapiens 

<400> 570 

Thr Lys Arg Lys Lys Thr lie Asn Leu Asn He Gin Asp Ala Gin Lys 
5 10 15 

Arq Thr Ala Leu His Trp Ala Cys Val Asn Gly His Glu Glu Val Val 
20 25 30 

Thr Phe Leu Val Asp Arg Lys Cys Gin Leu Asp Val Leu Asp Gly Glu 
35 40 45 

His Arg Thr Pro Leu Met Lys Ala Leu Gin Cys His Gin Glu Ala Cys 
50 55 60 

Ala Asn He Leu He Asp Ser Gly Ala Asp He Asn Leu Val Asp Val 
65 70 75 80 

Tvr Gly Asn Met Ala Leu His Tyr Ala Val Tyr Ser Glu He Leu Ser 
85 90 95 

Val Val Ala Lys Leu Leu Ser His Gly Ala Val He Glu Val His Asn 
100 105 110 

Lys Ala Ser Leu Thr Pro Leu Leu Leu Ser He Thr Lys Arg Ser Glu 
115 120 125 

Gin He Val Glu Phe Leu Leu He Lys Asn Ala Asn Ala Asn Ala Val 
130 135 140 

Asn Lvs Tyr Lys Cys Thr Ala Leu Met Leu Ala Val Cys His Gly Ser 
145 150 155 160 

Ser Glu He Val Gly Met Leu Leu Gin Gin Asn Val Asp Val Phe Ala 
165 170 175 

Ala Asp He Cys Gly Val Thr Ala Glu His Tyr Ala Val Thr Cys Gly 
180 185 190 

Phe His His He His Glu Gin He Met Glu Tyr He Arg Lys Leu Ser 
195 200 205 

Lys Asn His Gin Asn Thr Asn Pro Glu Gly Thr Ser Ala Gly Thr Pro 
210 215 220 

Asp Glu Ala Ala Pro Leu Ala Glu Arg Thr Pro Asp Thr Ala Glu Ser 
225 230 235 240 
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Leu Val Glu Lys Thr Pro Asp Glu Ala Ala Pro Leu Val Glu Arg Thr 
245 250 255 

Pro Asp Thr Ala Glu Ser Leu Val Glu Lys Thr Pro Asp Glu Ala Ala 
260 265 270 

Ser Leu Val Glu Gly Thr Ser Asp Lys lie Gin Cys Leu Glu Lys Ala 
275 280 285 

Thr Ser Gly Lys Phe Glu Gin Ser Ala Glu Glu Thr Pro Arg Glu He 
290 295 300 

Thr Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Pro Ala Lys 
305 310 315 320 

Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Asp Thr Pro Arg 
325 330 335 

Glu He Met Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Ala 
340 345 350 

Ala Lys Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Thr Pro 
355 360 365 

Val Lys Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys Val 
370 375 380 

Leu Glu Lys Gly Arg Ser Lys Met He Ala Cys Pro Thr Lys Glu 
385 390 395 



<210> 571 
<211> 247 
<212> PRT 
<213> Homo sapiens 

<400> 571 

Met Leu Ala Val Cys His Gly Ser Ser Glu He Val Gly Met Leu Leu 
5 10 15 

Gin Gin Asn Val Asp Val Phe Ala Ala Asp He Cys Gly Val Thr Ala 
20 25 30 

Glu His Tyr Ala Val Thr Cys Gly Phe His His He His Glu Gin He 
35 40 45 

Met Glu Tyr He Arg Lys Leu Ser Lys Asn His Gin Asn Thr Asn Pro 
50 55 60 

Glu Gly Thr Ser Ala Gly Thr Pro Asp Glu Ala Ala Pro Leu Ala Glu 
65 70 75 80 

Arg Thr Pro Asp Thr Ala Glu Ser Leu Val Glu Lys Thr Pro Asp Glu 
85 90 95 



Ala Ala 



Pro Leu Val 
100 



Glu Arg Thr Pro Asp Thr Ala Glu Ser Leu Val 
105 110 
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Glu Lys Thr Pro Asp Glu Ala Ala Ser Leu Val Glu Gly Thr Ser Asp 
115 120 125 

Lys He Gin Cys Leu Glu Lys Ala Thr Ser Gly Lys Phe Glu Gin Ser 
130 135 140 

Ala Glu Glu Thr Pro Arg Glu He Thr Ser Pro Ala Lys Glu Thr Ser 
145 150 155 160 

Glu Lys Phe Thr Trp Pro Ala Lys Gly Arg Pro Arg Lys He Ala Trp 
165 170 175 

Glu Lys Lys Glu Asp Thr Pro Arg Glu He Met Ser Pro Ala Lys Glu 
180 185 190 

Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys Gly Arg Pro Arg Lys He 
195 200 205 

Ala Trp Glu Lys Lys Glu Thr Pro Val Lys Thr Gly Cys Val Ala Arg 
210 215 220 

Val Thr Ser Asn Lys Thr Lys Val Leu Glu Lys Gly Arg Ser Lys Met 
225 230 235 240 

He Ala Cys Pro Thr Lys Glu 
245 



<210> 572 
<211> 399 
<212> PRT 
<213> Homo sapiens 

<400> 572 

Thr Lys Arg Lys Lys Thr He Asn Leu Asn He Gin Asp Ala Gin Lys 
5 10 15 

Arg Thr Ala Leu His Trp Ala Cys Val Asn Gly His Glu Glu Val Val 
20 25 30 

Thr Phe Leu Val Asp Arg Lys Cys Gin Pro Asp Val Leu Asp Gly Glu 
35 40 45 

His Arg Thr Pro Leu Met Lys Ala Leu Gin Cys His Gin Glu Ala Cys 
50 55 60 

Ala Asn He Leu He Asp Ser Gly Ala Asp He Asn Leu Val Asp Val 
65 70 75 80 

Tyr Gly Asn Met Ala Leu His Tyr Ala Val Tyr Ser Glu He Leu Ser 
85 90 95 

Val Val Ala Lys Leu Leu Ser His Gly Ala Val He Glu Val His Asn 
100 105 110 

Lys Ala Ser Leu Thr Pro Leu Leu Leu Ser He Thr Lys Arg Ser Glu 
115 120 125 
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Gin lie Val Glu Phe Leu Leu lie Lys Asn Ala Asn Ala Asn Ala Val 
130 135 140 

Asn Lys Tyr Lys Cys Thr Ala Leu Met Leu Ala Val Cys His Gly Leu 
145 150 155 160 

Ser Glu He Val Gly Met Leu Leu Gin Gin Asn Val Asp Val Phe Ala 
165 170 175 

Ala Asp He Cys Gly Val Thr Ala Glu His Tyr Ala Val Thr Cys Gly 
180 185 190 

Phe His His He His Glu Gin He Met Glu Tyr He Arg Lys Leu Ser 
195 200 205 

Lys Asn His Gin Asn Thr Asn Pro Glu Gly Thr Ser Ala Gly Thr Pro 
210 215 220 

Asp Glu Ala Ala Pro Leu Ala Glu Arg Thr Pro Asp Thr Ala Glu Ser 
225 230 235 240 

Leu Val Glu Lys Thr Pro Asp Glu Ala Ala Pro Leu Val Glu Arg Thr 
245 250 255 

Pro Asp Thr Ala Glu Ser Leu Val Glu Lys Thr Pro Asp Glu Ala Ala 
260 265 270 

Ser Leu Val Glu Gly Thr Ser Asp Lys He Gin Cys Leu Glu Lys Ala 
275 280 285 

Thr Ser Gly Lys Phe Glu Gin Ser Ala Glu Glu Thr Pro Arg Glu He 
290 295 300 

Thr Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Pro Ala Lys 
305 310 315 320 

Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Asp Thr Pro Arg 
325 330 335 

Glu He Met Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Ala 
340 345 350 

Ala Lys Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Thr Pro 
355 360 365 

Val Lys Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys Val 
370 375 380 

Leu Glu Lys Gly Arg Ser Lys Met He Ala Cys Pro Thr Lys Glu 
385 390 395 



<210> 573 

<211> 1349 

<212> PRT 

<213> Homo sapiens 
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;lt°Gln"is His His His His His His Thr Lys Arg Lys Lys Thr He 
5 10 

Asn Leu Asn He Gin Asp Ala Gin Lys Arg Thr Ala Leu His Trp Ala 
20 25 30 

Cys val Asn Gly His Glu Glu Val Val Thr Phe Leu Val Asp Arg Lys 

35 40 *^ 

cys Gin pro Asp Val Leu Asp Gly Glu His Arg Thr Pro Leu Met Lys 



50 



Ala Leu Gla Cys His Gin Glu Ala Cys Ala Asn He Leu He Asp Ser 



65 70 



75 80 



Gly Ala Asp He Asn Leu Val Asp Val Tyr Gly Asn Met Ala Leu His 



100 



85 

Tyr Ala Val Tyr Ser Glu He Leu Ser Val Val Ala Lys Leu Leu Ser 



105 



His Gly Ala val He Glu Val His Asn Lys Ala Ser Leu Thr Pro Leu 
115 120 125 



lie Thr Lys Arg Ser Glu Gin He Val Glu Phe Leu Leu 
135 140 

lie Lys Asn Ala Asn Ala Asn Ala Val Asn Lys Tyr Lys Cya Thr Ala 



Leu Leu Ser 
130 



145 



150 



Ala val Cys His Gly Leu Ser Glu He Val Gly Met Leu 



Leu Gin Gin Asa Val Asp Val Phe Ala Ala Asp He Cys Gly Val Thr 
180 185 

iaa Glu His Tyr Ala Val Thr Cys Gly Phe His His He His Glu Gin 
195 200 205 

lie Met Glu Tyr He Arg Lys Leu Ser Lys Asn His Gin Asn Thr Asn 
210 215 220 

pro Glu Gly Thr Ser Ala Gly Thr Pro Asp Glu Ala Ala Pro Leu Ala 
225 230 235 

Glu Arg Thr Pro Asp Thr Ala Glu Ser Leu Val Glu Lys Thr Pro Asp 
245 250 

Glu Ala Ala Pro Leu Val Glu Arg Thr Pro Asp Thr Ala Glu Ser Leu 
260 265 270 



Val 



Glu Lys Thr Pro Asp Glu Ala Ala Ser Leu Val Glu Gly Thr Ser 



275 280 



285 



ASP Lys He Gin Cys Leu Glu Lys Ala Thr Ser Gly Lys Phe Glu Gin 
290 295 300 
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Ser Ala Glu Glu Thr Pro Arg Glu He Thr Ser Pro Ala Lys Glu Thr 
305 310 315 320 

Ser Glu Lys Phe Thr Trp Pro Ala Lys Gly Arg Pro Arg Lys He Ala 
325 330 335 

Trp Glu Lys Lys Glu Asp Thr Pro Arg Glu He Met Ser Pro Ala Lys 
340 345 350 

Glu Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys Gly Arg Pro Arg Lys 
355 360 365 

He Ala Trp Glu Lys Lys Glu Thr Pro Val Lys Thr Gly Cys Val Ala 
370 375 380 

Arg Val Thr Ser Asn Lys Thr Lys Val Leu Glu Lys Gly Arg Ser Lys 
385 390 395 400 

Met He Ala Cys Pro Thr Lys Glu Ser Ser Thr Lys Ala Ser Ala Asn 
405 410 415 

Asp Gin Arg Phe Pro Ser Glu Ser Lys Gin Glu Glu Asp Glu Glu Tyr 
420 425 430 

Ser Cys Asp Ser Arg Ser Leu Phe Glu Ser Ser Ala Lys He Gin Val 
435 440 445 

Cys He Pro Glu Ser He Tyr Gin Lys Val Met Glu He Asn Arg Glu 
450 455 460 

Val Glu Glu Pro Pro Lys Lys Pro Ser Ala Phe Lys Pro Ala He Glu 
465 470 475 480 

Met Gin Asn Ser Val Pro Asn Lys Ala Phe Glu Leu Lys Asn Glu Gin 
485 490 495 

Thr Leu Arg Ala Asp Pro Met Phe Pro Pro Glu Ser Lys Gin Lys Asp 
500 505 510 

Tyr Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr Val Ser 
515 520 525 

Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His Gin Lys Glu He Asp 
530 . 535 540 

Lys He Asn Gly Lys Leu Glu Glu Ser Pro Asn Lys Asp Gly Leu Leu 
545 550 555 560 

Lys Ala Thr Cys Gly Met Lys Val Ser He Pro Thr Lys Ala Leu Glu 
565 570 575 

Leu Lys Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Gly Lys Pro Ser 
580 585 590 

Ala Phe Glu Pro Ala Thr Glu Met Gin Lys Ser Val Pro Asn Lys Ala 
595 600 605 

Leu Glu Leu Lys Asn Glu Gin Thr Trp Arg Ala Asp Glu He Leu Pro 
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610 615 620 

ser Glu ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Thr Glu 
625 630 635 640 

Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala 
645 650 655 

Ala His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Gly Ser 
660 665 670 

Pro Val Lys Asp Gly Leu Leu Lys Ala Asn Cys Gly Met Lys Val Ser 
675 680 685 

He Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala 
690 695 700 

Glu pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin 
705 710 715 720 

Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu 
725 730 735 

Arg Ala Asp Glu He Leu Pro Ser Glu Ser Lys Gin Lys Asp Tyr Glu 
740 745 750 

Glu ser ser Trp Asp Ser Glu Ser Leu Cys Glu Thr Val Ser Gin Lys 
755 760 765 

Asp Val Cys Leu Pro Lys Ala Thr His Gin Lys Glu He Asp Lys He 
770 775 780 

Asn Gly Lys Leu Glu Glu Ser Pro Asp Asn Asp Gly Phe Leu Lys Ala 
785 790 795 800 

Pro Cys Arg Met Lys Val Ser He Pro Thr Lys Ala Leu Glu Leu Met 
805 810 815 

Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala Phe 
820 825 830 

Glu Pro Ala He Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu Glu 
835 840 845 

Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Gin Met Phe Pro Ser Glu 
850 855 860 

Ser Lys Gin Lys Lys Val Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu 
865 870 875 880 

Arg Glu Thr Val Ser Gin Lys Asp Val Cys Val Pro Lys Ala Thr His 
885 890 895 

Gin Lys Glu Met Asp Lys He Ser Gly Lys Leu Glu Asp Ser Thr Ser 
900 905 910 

Leu Ser Lys He Leu Asp Thr Val His Ser Cys Glu Arg Ala Arg Glu 
915 920 925 
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Leu Gin Lys Asp His Cys Glu Gin Arg Thr Gly Lys Met Glu Gin Met 
930 935 940 

Lys Lys Lys Phe Cys Val Leu Lys Lys Lys Leu Ser Glu Ala Lys Glu 
945 950 955 960 

lie Lys Ser Gin Leu Glu Asn Gin Lys Val Lys Trp Glu Gin Glu Leu 
965 970 975 

Cys Ser Val Arg Leu Thr Leu Asn Gin Glu Glu Glu Lys Arg Arg Asn 
980 985 990 

Ala Asp He Leu Asn Glu Lys He Arg Glu Glu Leu Gly Arg lie Glu 
995 1000 1005 

Glu Gin His Arg Lys Glu Leu Glu Val I^ys Gin Gin Leu Glu Gin Ala 
1010 1015 1020 

Leu Arg He Gin Asp He Glu Leu Lys Ser Val Glu Ser Asn Leu Asn 
1025 1030 1035 1040 

Gin Val Ser His Thr His Glu Asn Glu Asn Tyr Leu Leu His Glu Asn 
1045 1050 1055 

Cys Met Leu Lys Lys Glu He Ala Met Leu Lys Leu Glu He Ala Thr 
1060 1065 1070 

Leu Lys His Gin Tyr Gin Glu Lys Glu Asn Lys Tyr Phe Glu Asp He 
1075 1080 1085 

Lys He Leu Lys Glu Lys Asn Ala Glu Leu Gin Met Thr Leu Lys Leu 
1090 1095 1100 

Lys Glu Glu Ser Leu Thr Lys Arg Ala Ser Gin Tyr Ser Gly Gin Leu 
1105 1110 1115 1120 

Lys Val Leu He Ala Glu Asn Thr Met Leu Thr Ser Lys Leu Lys Glu 
1125 1130 1135 

Lys Gin Asp Lys Glu He Leu Glu Ala Glu He Glu Ser His His Pro 
1140 1145 1150 

Arg Leu Ala Ser Ala Val Gin Asp His Asp Gin He Val Thr Ser Arg 
1155 1160 1165 

Lys Ser Gin Glu Pro Ala Phe His He Ala Gly Asp Ala Cys Leu Gin 
1170 1175 1180 

Arg Lys Met Asn Val Asp Val Ser Ser Thr He Tyr Asn Asn Glu Val 
1185 1190 1195 1200 

Leu His Gin Pro Leu Ser Glu Ala Gin Arg Lys Ser Lys Ser Leu Lys 
1205 1210 1215 



He Asn Leu Asn Tyr Ala Gly Asp Ala Leu Arg Glu Asn Thr Leu Val 
1220 1225 1230 
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Ser Glu His Ala Gin Arg Asp Gin Arg Glu Thr Gin Cys Gin Met Lys 
1235 1240 1245 

Glu Ala Glu His Met Tyr Gin Asn Glu Gin Asp Asn Val Asn Lys His 
1250 1255 1260 

Thr Glu Gin Gin Glu Ser Leu Asp Gin Lys Leu Phe Gin Leu Gin Ser 
1265 1270 1275 1280 

Lys Asn Met Trp Leu Gin Gin Gin Leu Val His Ala His Lys Lys Ala 
1285 1290 1295 

Asp Asn Lys Ser Lys lie Thr lie Asp lie His Phe Leu Glu Arg Lys 
1300 1305 1310 

Met Gin His His Leu Leu Lys Glu Lys Asn Glu Glu lie Phe Asn Tyr 
1315 1320 1325 

Asn Asn His Leu Lys Asn Arg lie Tyr Gin Tyr Glu Lys Glu Lys Ala 
1330 1335 1340 

Glu Thr Glu Val He 
1345 



<210> 574 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 574 

cacacaaaga ggaagaagac catc 24 



<210> 575 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 575 

gattcttttg taggacatgc aatcatc 27 



<210> 576 

<211> 3720 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 1149 
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<223> n = A,T,C or G 
<400> 576 

atggtggcaa cactgctgtc ctatggtgca 
acaccccttt tactggccat acagaaaaga 
aaaaatgcaa atgcaaacgc atttaatgag 
tgtgaaggct catcagagat agtcggcatg 
gaagacatac atggaataac tgcagaacgt 
catcaacaac ttttggaaca tatacgaaaa 
gaaggaacat ctacaggaac acctgatgag 
acggctgaaa gcttgctgga aaaaacacct 
tctgccaaaa ttcaatgtct ggggaaagca 
gaaacaccta ggaaaatttt gaggcctaca 
gcaaaagaaa gatctaggaa gatcacatgg 
tgcgtggcag gagtaacacc taataaaact 
attgcatgtc ctacaaaaga aacatctaca 
gtagagccta tattcagtct ttttggcaca 
gttgaggaag actttaatct tgctaccaag 
acgtgtttac ctgatgctac atatcaaaaa 
gatcagatgt tcccatcaga atccaaacga 
gggagtctct ttgagagttc tgcaaagact 
aaagtaatgg agataaatag agaagtagaa 
cctgccgtng aaatgcaaaa gactgttcca 
acattgagag cagctcagat gttcccatca 
tcttgggatt ctgagagtcc ctgtgagacg 
gctacacatc aaaaagaatt cgatacctta 
gatggtcttc tgaagcctac ctgtggaagg 
ttaaaggaca gagaaacatt. caaagcagag 
acctgtggaa ggaaagtttc tcttccaaat 
ctcaaagcag agtctcctga taatgatggt 
tctcttccaa ataaagcttt agaattgaag 
ttcccatcag aatccaaaca aaaggatgat 
cttgagactc tcttacagaa tgatgtgtgt 
gataccttaa gtggaaaatt agaagagtct 
tgtggaatga aaatttctct tccaaataaa 
aaagcagagg atgtgagttc tgtagagtcc 
gaaaattcac agtctacaaa agttgaggaa 
acaaagacag taactggaca acaggaacgt 
gatcaaacaa ataagatgcc cacatcagaa 
tcagattctg agattatctc tgtgagtgat 
acatatcaaa aagaaataaa gacaacaaat 
tctcactttg agcctgccac tgaaatgcaa 
aagaataaac aaacattgag agcagattca 
ccttcttgtg aaagaggaag ggaacttaaa 
atggaacaaa tgaaaaataa gttttgtgta 
ataaaatcac ag-t-tagagaa ccaaaaagct 
ttgcctttaa atcaagaaga agagaagaga 
agacccgaag agcaacttag gaaaaagtta 
agaatacaag a-ka-bagaatt gaaaagtgta 
catgaaagtg aaaatgatct ctttcatgaa 
ctaaaactgg aagtagccac actgaaacat 
gaggacatta agattttaca agaaaagaat 
cagaaaacag taacaaaaag ggcatctcag 
gagaacacga tgctgacttc taaattgaag 
gaaattgaat cacaccatcc tagactggct 
acatcaagaa aaaaccaaga acttgctttc 
ataatgaatg ttgatgtgag taatacaata 
tatgaagctc aaaggaaatc caaaagccca 



gtcatcgagg tgcaaaacaa ggctagcctc 60 
agcaagcaaa ctgtggaatt tttactaaca 120 
tctaaatgca cagccctcat gcttgccata 180 
cttcttcagc aaaatgttga cgtctttgct 240 
tatgctgctg ctcgtggagt taattacatt 300 
ttacctaaaa atcctcaaaa taccaatcca 360 
gctgcaccct tggcggaaag aacacctgac 420 
gacgaggctg cacgcttggt ggagggaacg 480 
acatctggaa agtttgaaca gtcaacagaa 540 
aaagaaacat ctgagaaatt ttcatggcca 600 
gaggaaaaag aaacatctgt aaagactgaa 660 
gaagttttgg aaaaaggaac atctaatatg 720 
aaagcaagta caaatgtgga tgtgagttct 780 
cggactattg aaaattcaca gtgtacaaaa 840 
attatctcta agagtgctgc acagaattat 900 
gatatcaaaa caataaatca caaaatagaa 960 
gaggaagatg aagaatattc ttgggattct 1020 
caagtgtgta tacctgagtc tatgtatcag 1080 
gagcttcctg agaagccatc tgccttcaag 1140 
aataaagcct ttgaattgaa gaatgaacaa 1200 
gaatccaaac aaaaggacga tgaagaaaat 1260 
gtttcacaga aggatgtgta tttacccaaa 1320 
agtggaaaat tagaagagtc tcctgttaaa 1380 
aaagtttctc ttccaaataa agccttagaa 1440 
tctcctgata aagatggtct tctgaagcct 1500 
aaagccttag aattaaagga cagagaaaca 1560 
cttctgaagc ctacctgtgg aaggaaagtt 1620 
gacagagaaa cattcaaagc agctcagatg 1680 
gaagaaaatt cttgggattt tgagagtttc 1740 
ttacccaagg ctacacatca aaaagaattc 1800 
cctgataaag atggtcttct gaagcctacc 1860 
gccttagaat tgaaggacag agaaacattc 1920 
acattcagtc tttttggcaa accgactact 1980 
gactttaatc ttactaccaa ggagggagca 2040 
gatattggca ttattgaacg agctccacaa 2100 
ttaggaagaa aagaagatac aaaatcaact 2160 
acacagaatt atgagtgttt acctgaggct 2220 
ggcaaaatag aagagtctcc tgaaaagcct 2280 
aactctgttc caaataaagg cttagaatgg 2340 
actaccctat caaaaatctt ggatgcactt 2400 
aaagataact gtgaacaaat tacagcaaaa 2460 
ctacaaaagg aactgtcaga agcgaaagaa 2520 
aaatgggaac aagagctctg cagtgtgaga 2580 
agaaatgtcg atatattaaa agaaaaaatt 2640 
gaagtgaaac accaacttga acagactctc 2700 
acaagtaatt tgaatcaggt ttctcacact 2760 
aattgcatgt tgaaaaagga aattgccatg 2820 
caacaccagg tgaaggaaaa taaatacttt 2880 
gctgaacttc aaatgaccct aaaactgaaa 2940 
tatagagagc agcttaaagt tctgacggca 3000 
gaaaaacaag acaaagaaat actggagaca 3060 
tctgctttac aagaccatga tcaaagtgtc 3120 
cacagtgcag gagatgctcc tttgcaagga 3180 
tataacaatg aggtgctcca tcaaccactt 3240 
aaaattaatc tcaattatgc aggagatgat 3300 
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ctaagagaaa atgcattggt ttcagaacat gcacaaagag accgatgtga aacacagtgt 3360 
caaatgaaga aagctgaaca catgtatcaa aatgaacaag ataatgtgga caaacacact 3420 
gaacagcagg agtctctgga gcagaaatta tttcaactag aaagcaaaaa taggtggctt 3480 
cgacagcaat tagtttatgc acataagaaa gttaacaaaa gcaaggtaac aattaatatt 3540 
cagtttcctg agatgaaaat gcaacgtcat ctaaaagaga aaaatgagga ggtattcaat 3600 
tatggtaacc atttaaaaga acgtatagat caatatgaaa aagagaaagc agaaagagaa 3660 
gtaagtatca aaaaatataa atacttttca aacttcctga aagaaagtgg ccttggctaa 3720 

<210> 577 

<211> 1239 

<212> PRT 

<213> Homo sapiens 

<400> 577 

Met Val Ala Thr Leu Leu Ser Tyr Gly Ala Val He Glu Val Gin Asn 
5 10 15 

Lys Ala Ser Leu Thr Pro Leu Leu Leu Ala He Gin Lys Arg Ser Lys 
20 25 30 

Gin Thr Val Glu Phe Leu Leu Thr Lys Asn Ala Asn Ala Asn Ala Phe 
35 40 45 

Asn Glu Ser Lys Cys Thr Ala Leu Met Leu Ala He Cys Glu Gly Ser 
50 55 60 

Ser Glu He Val Gly Met Leu Leu Gin Gin Asn Val Asp Val Phe Ala 
65 70 75 80 

Glu Asp He His Gly He Thr Ala Glu Arg Tyr Ala Ala Ala Arg Gly 
85 90 95 

Val Asn Tyr He His Gin Gin Leu Leu Glu His He Arg Lys Leu Pro 
100 105 110 

Lys Asn Pro Gin Asn Thr Asn Pro Glu Gly Thr Ser Thr Gly Thr Pro 
115 120 125 

Asp Glu Ala Ala Pro Leu Ala Glu Arg Thr Pro Asp Thr Ala Glu Ser 
130 135 140 

Leu Leu Glu Lys Thr Pro Asp Glu Ala Ala Arg Leu Val Glu Gly Thr 
145 150 155 160 

Ser Ala Lys He Gin Cys Leu Gly Lys Ala Thr Ser Gly Lys Phe Glu 
165 170 175 

Gin Ser Thr Glu Glu Thr Pro Arg Lys He Leu Arg Pro Thr Lys Glu 
180 185 190 

Thr Ser Glu Lys Phe Ser Trp Pro Ala Lys Glu Arg Ser Arg Lys He 
195 200 205 

Thr Trp Glu Glu Lys Glu Thr Ser Val Lys Thr Glu Cys Val Ala Gly 
210 215 220 

Val Thr Pro Asn Lys Thr Glu Val Leu Glu Lys Gly Thr Ser Asn Met 
225 230 235 240 
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He Ala Cys Pro Thr Lys Glu Thr Ser Thr Iiys Ala Ser Thr Asn Val 
245 250 255 

Asp Val Ser Ser Val Glu Pro He Phe Ser Leu Phe Gly Thr Arg Thr 
260 265 270 

He Glu Asn Ser Gin Cys Thr Lys Val Glu Glu Asp Phe Asn Leu Ala 
275 280 285 

Thr Lys He He Ser Lys Ser Ala Ala Gin Asn Tyr Thr Cys Leu Pro 
290 295 300 

Asp Ala Thr Tyr Gin Lys Asp He Lys Thr He Asn His Lys He Glu 
305 310 315 320 

Asp Gin Met Phe Pro Ser Glu Ser Lys Arg Glu <31u Asp Glu Glu Tyr 
325 330 335 

Ser Trp Asp Ser Gly Ser Leu Phe Glu Ser Ser Ala Lys Thr Gin Val 
340 345 350 

Cys He Pro Glu Ser Met Tyr Gin Lys Val Met Glu He Asn Arg Glu 
355 360 365 

Val Glu Glu Leu Pro Glu Lys Pro Ser Ala Phe Lys Pro Ala Val Glu 
370 375 380 

Met Gin Lys Thr Val Pro Asn Lys Ala Phe Glu Leu Lys Asn Glu Gin 
385 390 395 400 

Thr Leu Arg Ala Ala Gin Met Phe Pro Ser Glu Ser Lys Gin Lys Asp 
405 410 415 

Asp Glu Glu Asn Ser Trp Asp Ser Glu Ser Pro Cys Glu Thr Val Ser 
420 425 430 

Gin Lys Asp Val Tyr Leu Pro Lys Ala Thr His Gin Lys Glu Phe Asp 
435 440 445 

Thr Leu Ser Gly Lys Leu Glu Glu Ser Pro Val Lys Asp Gly Leu Leu 
450 455 460 

Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn Lys Ala Leu Glu 
465 470 475 480 

Leu Lys Asp Arg Glu Thr Phe Lys Ala Glu Ser Pro Asp Lys Asp Gly 
485 490 495 

Leu Leu Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn Lys Ala 
500 505 510 

Leu Glu Leu Lys Asp Arg Glu Thr Leu Lys Ala Glu Ser Pro Asp Asn 
515 520 525 



Asp Gly Leu Leu Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn 
530 535 540 
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Lys Ala Leu Glu Leu Lys Asp Arg Glu Thr Phe Lys Ala Ala Gin Met 
545 550 555 560 

Phe Pro Ser Glu Ser Lys Gin Lys Asp Asp Glu Glu Asn Ser Trp Asp 
565 570 575 

Phe Glu Ser Phe Leu Glu Thr Leu Leu Gin Asn Asp Val Cys Leu Pro 
580 585 590 

Lys Ala Thr His Gin Lys Glu Phe Asp Thr Leu Ser Gly Lys Leu Glu 
595 600 605 

Glu Ser Pro Asp Lys Asp Gly Leu Leu Lys Pro Thr Cys Gly Met Lys 
610 615 620 

lie Ser Leu Pro Asn Lys Ala Leu Glu Leu Lys Asp Arg Glu Thr Phe 
625 630 , 635 640 

Lys Ala Glu Asp Val Ser Ser Val Glu Ser Thr Phe Ser Leu Phe Gly 
645 650 655 

Lys Pro Thr Thr Glu Asn Ser Gin Ser Thr Lys Val Glu Glu Asp Phe 
660 665 670 

Asn Leu Thr Thr Lys Glu Gly Ala Thr Lys Thr Val Thr Gly Gin Gin 
675 680 685 

Glu Arg Asp He Gly He He Glu Arg Ala Pro Gin Asp. Gin Thr Asn 
690 695 700 

Lys Met Pro Thr Ser Glu Leu Gly Arg Lys Glu Asp Thr Lys Ser Thr 
705 710 715 720 

Ser Asp Ser Glu He He Ser Val Ser Asp Thr Gin Asn Tyr Glu Cys 
725 730 735 

Leu Pro Glu Ala Thr Tyr Gin Lys Glu He Lys Thr Thr Asn Gly Lys 
740 745 750 

He Glu Glu Ser Pro Glu Lys Pro Ser His Phe Glu Pro Ala Thr Glu 
755 760 765 

Met Gin Asn Ser Val Pro Asn Lys Gly Leu Glu Trp Lys Asn Lys Gin 
770 775 780 

Thr Leu Arg Ala Asp Ser Thr Thr Leu Ser Lys He Leu Asp Ala Leu 
785 790 795 800 

Pro Ser Cys Glu Arg Gly Arg Glu Leu Lys Lys Asp Asn Cys Glu Gin 
805 810 815 

He Thr Ala Lys Met Glu Gin Met Lys Asn Lys Phe Cys Val Leu Gin 
820 825 830 

Lys Glu Leu Ser Glu Ala Lys Glu He Lys Ser Gin Leu Glu Asn Gin 
835 840 845 

Lys Ala Lys Trp Glu Gin Glu Leu Cys Ser Val Arg Leu Pro Leu Asn 
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850 855 860 

Gin Glu Glu Glu Lys Arg Arg Asn Val Asp lie Leu Lys Glu Lys lie 
865 870 875 880 

Arg Pro Glu Glu Gin Leu Arg Lys Lys Leu Glu Val Lys His Gin Leu 
885 890 895 

Glu Gin Thr Leu Arg lie Gin Asp lie Glu Leu Lys Ser Val Thr Ser 
900 905 910 

Asn Leu Asn Gin Val Ser His Thr His Glu Ser Glu Asn Asp Leu Phe 
915 920 925 

His Glu Asn Cys Met Leu Lys Lys Glu He Ala Met Leu Lys Leu Glu 
930 935 940 

Val Ala Thr Leu Lys His Gin His Gin Val Lys Glu Asn Lys Tyr Phe 
945 950 955 960 

Glu Asp He Lys He Leu Gin Glu Lys Asn Ala Glu Leu Gin Met Thr 
965 970 975 

Leu Lys Leu Lys Gin Lys Thr Val Thr Lys Arg Ala Ser Gin Tyr Arg 
980 985 990 

Glu Gin Leu Lys Val Leu Thr Ala Glu Asn Thr Met Leu Thr Ser Lys 
995 1000 1005 

Leu Lys Glu Lys Gin Asp Lys Glu He Leu Glu Thr Glu He Glu Ser 
1010 1015 1020 

His His Pro Arg Leu Ala Ser Ala Leu Gin Asp His Asp Gin Ser Val 
1025 1030 1035 1040 

Thr Ser Arg Lys Asn Gin Glu Leu Ala Phe His Ser Ala Gly Asp Ala 
1045 1050 1055 

Pro Leu Gin Gly He Met Asn Val Asp Val Ser Asn Thr He Tyr Asn 
1060 1065 1070 

Asn Glu Val Leu His Gin Pro Leu Tyr Glu Ala Gin Arg Lys Ser Lys 
1075 1080 1085 

Ser Pro Lys He Asn Leu Asn Tyr Ala Gly Asp Asp Leu Arg Glu Asn 
1090 1095 1100 

Ala Leu Val Ser Glu His Ala Gin Arg Asp Arg Cys Glu Thr Gin Cys 
1105 1110 1115 1120 

Gin Met Lys Lys Ala Glu His Met Tyr Gin Asn Glu Gin Asp Asn Val 
1125 1130 1135 

Asp Lys His Thr Glu Gin Gin Glu Ser Leu Glu Gin Lys Leu Phe Gin 
1140 1145 1150 

Leu Glu Ser Lys Asn Arg Trp Leu Arg Gin Gin Leu Val Tyr Ala His 
1155 1160 1165 
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Lys Lys Val Asn Lys Ser Lys Val 
1170 1175 

Met Lys Met Gin Arg His Leu Lys 
1185 1190 

Tyr Gly Asn His Leu Lys Glu Arg 
1205 

Ala Glu Arg Glu Val Ser lie Lys 
1220 

Leu Lys Glu Ser Gly Leu Gly 
1235 



Thr He Asn He Gin Phe Pro Glu 
1180 



Glu Lys Asn Glu Glu Val Phe Asn 
1195 1200 

He Asp Gin Tyr Glu Lys Glu Lys 
1210 1215 

Lys Tyr Lys Tyr Phe Ser Asn Phe 
1225 1230 



<210> 578 

<211> 20 

<212> PRT 

<213> Homo sapiens 

LyrLn^Glu Glu He Phe Asn Tyr Asn Asn His Leu Lys Asn Arg He 
^5 10 15 

Tyr Gin Tyr Glu 
20 



<210> 579 

<211> 20 

<212> PRT 

<213> Homo sapiens 

Glu°Gl"Lp Asn Vai Asn Lys His Thr Glu Gin Gin Glu Ser Leu Asp 
5 10 

Gin Lys Leu Phe 
20 



<210> 580 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 580 - 
Thr Glu Gin Gin Glu Ser Leu Asp Gin Lys Leu Phe Gin Leu GXn ber 
5 10 15 



Lys Asn Met Trp 
20 
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<210> 581 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 581 

Lys Glu Glu Ser Leu Thr Lys Arg Ala Ser Gin Tyr Ser Gly Gin Leu 
5 10 15 

Lys Val Leu lie 
20 



<210> 582 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 582 

lie Ala Gly Asp Ala Cys Leu Gin Arg Lys Met Asn Val Asp Val Ser 
5 10 15 

Ser Thr He Tyr 
20 



<210> 583 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 583 

Arg Lys Met Asn Val Asp Val Ser Ser Thr lie Tyr Asn Asn Glu Val 
5 10 15 

Leu His Gin Pro 
20 



<210> 584 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 584 

Met Gly Thr Arg Ala Leu Gin Cys Glu Val Ser His Thr His Glu Asn 
5 10 15 

Glu Asn Tyr Leu 
20 



<210> 585 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 585 
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Glu Val Ser His Thr His Glu Asn Glu Asn Tyr Leu Leu His Glu Asn 
5 10 15 

Cys Met Leu Lys 
20 



<210> 586 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 586 

Glu Asn Tyr Leu Leu His Glu Asn Leu Met Leu Lys Lys Glu lie Ala 
5 10 15 

Met Leu Lys Leu 
20 



<210> 587 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 587 

Asn Cys Met Leu Lys Lys Glu lie Ala Met Leu Lys Leu Glu He Ala 
5 10 15 

Thr Leu Lys His Gin 
20 



<210> 588 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 588 

Met Leu Thr Ser Lys Leu Lys Glu Lys Gin Asp Lys Glu lie Leu Glu 
5 10 15 

Ala Glu He Glu 
20 



<210> 589 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 589 

Lys Gin Asp Lys Glu He Leu Glu Ala Glu He Glu Ser His His Pro 
5 10 15 



Arg Leu Ala Ser 
20 
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<210> 590 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 590 

Ala Glu lie Glu Ser His His Pro Arg Leu Ala Ser Ala Val Gin Asp 
5 10 15 

His Asp Gin lie 
20 



<210> 591 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 591 

Arg Leu Ala Ser Ala Val Gin Asp His Asp Gin lie Val Thr Ser Arg 
5 10 15 

Lys Ser Gin Glu 
20 



<210> 592 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 592 

His Asp Gin lie Val Thr Ser Arg Lys Ser Gin Glu Pro Ala Phe His 
5 10 15 

lie Ala Giy Asp 
20 



<210> 593 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 593 

Lys Ser Gin Glu Pro Ala Phe His lie Ala Gly Asp Ala Cys Leu Gin 
5 10 15 

Arg Lys Met Asn 
20 



<2a0> 594 

<211> 20 

<212> PRT 

<213> Homo sapiens 
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<400> 594 

Met Gly Thr Arg 2aa Leu Gin Cys 

1 5 
Glu Asn Tyr Leu 
20 



Glu Val Ser Hia Thr His Glu Asn 
10 15 



<210> 595 

<211> 20 

<212> PRT 

<213> Homo sapiens 

His Glu Asn Glu Asn Tyr Leu Leu His Glu Asn Cys 
5 10 15 

Glu 
20 



<400> 595 
Ser His Thr 

1 

Leu Lys Lys 



<210> 596 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 596 

Leu His Glu Asn Cys Met Leu Lys 

1 5 
Glu lie Ala Thr 
20 



Lys Glu lie Ala Met Leu Lys Leu 
10 15 



<210> 597 
<211> 20 
<212> PRT 

<213> Homo sapiens 



ne'Ilf Met Leu Lys Leu Glu He Ala Thr Leu Lys His Gin Tyr Gin 

1 5 10 

Glu Lys Glu Asn 
20 



<210> 598 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 598 

Leu Lys His Gin Tyr Gin Glu Lys 

1 5 
Lys He Leu Lys 
20 



Glu Asn Lys Tyr Phe Glu Asp He 
10 15 



<210> 599 
<211> 20 
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<212> PRT 

<213> Homo sapiens 

<400> 599 

Lys Tyr Phe Glu Asp lie Lys lie Leu Lys Glu Lys Asn Ala Glu Leu 

1 5 10 15 

Gin Met Thr Leu 
20 



<210> 600 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 600 

Glu Lys Asn Ala Glu Leu Gin Met Thr Leu Lys Leu Lys Glu Glu Ser 

15 10 15 

Leu Thr Lys Arg 
20 



<210> 601 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 601 

Lys Leu Lys Glu Glu Ser Leu Thr Lys Arg Ala Ser Gin Tyr Ser Gly 

15 10 15 

Gin Leu Lys Val 
20 



<210> 602 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 602 

Ala Ser Gin Tyr Ser Gly Gin Leu Lys Val Leu lie Ala Glu Asn Thr 

15 10 15 

Met Leu Thr Ser 
20 



<210> 603 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 603 

Leu He Ala Glu Asn Thr Met Leu Thr Ser Lys Leu Lys Glu Lys Gin 

15 10 15 

Asp Lys Glu lie 
20 
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<210> 604 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 604 

Lys Leu Lys Glu Lys Gin Asp Lys 

1 5 
Ser His His Pro 
20 



Glu lie Leu Glu Ala Glu He Glu 
10 15 



<210> 605 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 605 

Leu Glu Ala Glu lie Glu Ser His 

1 5 
Gin Asp His Asp 
20 



His Pro Arg Leu Ala Ser Ala Val 
10 15 



<210> 606 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 606 

Arg Leu Ala Ser Ala Val Gin Asp His Asp Gin He Val Thr Ser Arg 

15 10 15 

Lys Ser Gin Glu 
20 



<210> 607 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 607 

ASP Gin He Val Thr Ser Arg Lys Ser Gin Glu Pro Ala Phe His He 

1 5 10 15 

Ala Gly Asp Ala Cys Leu 
20 



<210> 608 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 608 

Pro Ala Phe His He Ala Gly Asp Ala Cys Leu Gin Arg Lys Met Asn 

15 10 15 

Val Asp Val Ser 
20 
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<210> 609 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 609 

Leu Gin Arg Lys Met Asn Val Asp 

1 5 
Glu Val Leu His 
20 



Val Ser Ser Thr lie Tyr Asn Asn 
10 15 



<210> 610 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 610 

Ser Thr He Tyr Asn Asn Glu Val 

1 5 
Gin Arg Lys Ser 
20 



Leu His Gin Pro Leu Ser Glu Ala 
10 15 



<210> 611 
<211> 21 
<212> PRT 

<213> Homo sapiens 



<400> 611 

His Gin Pro Leu Ser Glu -Ala Gin Arg Lys Ser Lys Ser Leu Lys He 

15 10 15 

Asn Leu Asn Tyr Ala 
20 



<210> 612 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 612 

Lys Ser Leu Lys He Asn Leu Asn 

1 5 
Asn Thr Leu Val 
20 



Tyr Ala Gly Asp Ala Leu Arg Glu 
10 15 



<210> 613 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 613 

Gly Asp Ala Leu Arg Glu Asn Thr 
1 5 



Leu Val Ser Glu His Ala Gin Arg 
10 15 
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Asp Gin Arg Glu 
20 



<210> 614 
<211> 20 
<212> PRT 

<213> Homo sapiens 



S^GlS^His iaa Gin Arg Asp Gin Arg Glu Thr Gin Cys Gin Met Lys 

1 5 
Glu Ala Glu His 
20 



<210> 615 
<211> 20 
<212> PRT 

<213> Homo sapiens 



tl'llnlys Gin Met Lys Glu Ala Glu His Met Tyr Gin Asn Glu Gin 

1 5 10 

Asp Asn Val Asn 
20 



<210> 616 
<211> 20 
<212> PRT 

<213> Homo sapiens 



M^rTyr^Gln Asn Glu Gin Asp Asn Val Asn Lys His Thr Glu Gin Gin 

1 5 10 

Glu Ser Leu Asp 
20 



<210> 617 
<211> 20 
<212> PRT 

<213> Homo sapiens 



Ly^His^ihr Glu Gin Gin Glu Ser Leu Asp Gin Lys Leu Phe Gin Leu 

15 10 15 

Gin Ser Lys Asn 
20 



<210> 618 
<211> 21 
<212> PRT 

<213> Homo sapiens 



Jsp^Gln^Lys Leu Phe Gin Leu Gin Ser Lys Asn Met Trp Leu Gin Gin 
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1 5 
Gla Leu Val His Ala 
20 



10 



15 



<210> 619 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 619 

Met Trp Leu Gin Gin Gin Leu Val His Ala His Lys Lys Ala Asp Asn 

15 10 15 

Lys Ser Lys lie 
20 



<210> 620 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 620 

His Lys Lys Ala Asp Asn Lys Ser Lys lie Thr He Asp He His Phe 

15 10 15 

Leu Glu Arg Lys 
20 



<210> 621 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 621 

Thr He Asp He His Phe Leu Glu Arg Lys Met Gin His His Leu Leu 

15 10 15 

Lys Glu Lys Asn 
20 



<210> 622 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 622 

Met Gin His His Leu Leu Lys Glu 

1 5 
Asn Asn His Leu 
20 



Lys Asn Glu Glu He Phe Asn Tyr 
10 15 



<210> 623 
<211> 20 
<212> PRT 

<213> Homo sapiens 
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<400> 623 

Glu Glu He Phe Asn Tyr Asn Asn 

1 5 
Tyr Glu Lys Glu 
20 



His Leu Lys Asn Arg He Tyr Gin 
10 15 



<210> 624 
<211> 20 
<212> PRT 

<213> Homo sapiens 

^sn^His'Lu Lys Asn Arg He Tyr Gin Tyr Glu Lys Glu Lys Ala Glu 

1 5 10 

Thr Glu Val He 
20 



<210> 625 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 625 

Leu Thr Leu Asn Gin Glu Glu Glu 

1 5 
Asn Glu Lys He Arg Glu Glu Leu 
20 



Lys Arg Arg Asn Ala Asp He Leu 

10 15 
Gly Cys Gly 
25 



<210> 626 
<211> 29 
<212> PRT 

<213> Homo sapiens 
<400> 626 

He Arg Glu Glu Leu Gly Arg He 

1 5 
Glu Val Lys Gin Gin Leu Glu Gin 
20 



Glu Glu Gin His Arg Lys Glu Leu 

10 15 
Ala Leu Gly Cys Gly 
25 



<210> 627 
<211> 24 
<212> PRT 

<213> Homo sapiens 



LeS^Glu^Gln Ala Leu Arg He Gin Asp He Glu Leu Lys Ser Val Glu 
1 

Ser Asn Leu Asn Gin Gly Cys Gly 
20 
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